Details
-
Sub-task
-
Status: Resolved
-
Major
-
Resolution: Not A Problem
-
None
-
None
-
None
-
None
Description
On lucene.apache.org we use Google Analytics tracking
GOOGLE_ANALYTICS_TRACKING_ID = 'UA-94576-12'
I think the reason was so that we could estimate downloads from mirrors, by counting number of clicks on the links from download pages. But are anyone ever looking at or publishing those numbers?
The ASF wants projects to stop using 3rd party tracking of users and instead ask INFRA for aggregated stats for the page. WDYT? Should we
- Remove trackers from both sites and rely on stats from infra
- Continue using Google analytics, but have someone actually publish numbers from it every month?
- Use some other way of counting downloads?
What do we get without a tracker?
INFRA provides anonymous page view stats here https://uls.apache.org/exports/lucene.apache.org.yaml which gives some insight. But not downloads specifically. We see 12k visits to Solr downloads page last months, but we don't know how many of those clicked...
Sheet3: Name: Most visited pages, past month Values: /solr/index.html: 33604 /index.html: 27588 /solr/downloads.html: 12118 /core/2_9_4/queryparsersyntax.html: 11135 /core/index.html: 10353 /solr/guide/solr-tutorial.html: 9734 /solr/resources.html: 8014 /solr/features.html: 7046 /solr/guide/8_8/solr-tutorial.html: 6099 /solr/news.html: 5843 /solr/guide/6_6/the-standard-query-parser.html: 5216 /solr/guide/index.html: 4430 /solr/guide/6_6/common-query-parameters.html: 4379 /core/downloads.html: 3644
There's an interesting section at the bottom of that YAML page, wonder if it could be enabled in some way
Sheet6: Name: Downloads, past month Values: {}
Attachments
Attachments
Issue Links
- relates to
-
LUCENE-9858 Remove Google Analytics from Lucene site
- Open
-
SOLR-15275 Remove Google Analytics from Solr site
- Resolved
- mentioned in
-
Page Loading...
Google Analytics is a lot more than what INFRA provides, though that's useful too. It can do breakdown reports, such as referrers to specific pages, etc. This would allow figuring out if somebody refers to old pages and try to fix things. I wish we had Google Analytics on RefGuide as well, for the same reasons.
But that's only useful if somebody actually has access to Google Analytics and does something with it. I work a tiny bit with GA, would be happy to have a look at the account and produce a couple of reports (maybe shared to committers or PMC only). Then, we can decide whether it is worth keeping it.
There are also non GA ways to track analytics, e.g. https://matomo.org/ which is free on premise. But that needs INFRA to own it.