An RSS reader sends periodic requests to get the latest feed. This includes a User-Agent field, identifying which fetcher is running:
Feedbin feed-id:1242010 - 38 subscribersThis fetcher is nicely passing along statistics, saying how many readers it represents.
I took one day of logs, with 5,962 requests for my RSS feed:
$ sudo grep '"GET /news.rss ' \ /var/log/nginx/access.log.1 \ | awk -F'"' '{print $6}' \ | wc -l 5962There were 162 unique User-Agents:
$ sudo grep '"GET /news.rss ' \ /var/log/nginx/access.log.1 \ | awk -F'"' '{print $6}' \ | sort \ | uniq \ | wc -l 162Of the 5,962 requests, 932 (16%) gave stats:
$ sudo grep '"GET /news.rss ' \ /var/log/nginx/access.log.1 \ | awk -F'"' '{print $6}' \ | grep 'subscriber\|reader' \ | wc -l 932They sent 21 distinct User-Agents:
$ sudo grep '"GET /news.rss ' \ /var/log/nginx/access.log.1 \ | awk -F'"' '{print $6}' \ | grep 'subscriber\|reader' \ | sort \ | uniq \ | wc -l 21Some sent multiple requests with different numbers of subscribers:
Feedbin feed-id:1242010 - 38 subscribers Feedbin feed-id:372940 - 11 subscribers Feedbin feed-id:382 - 1 subscribersI suspect this comes from people using old URLs that then get redirected to my current URL. For example, now it's
https://www.jefftk.com/news.rss
, but it used to be
http://www.jefftk.com/news.rss
, and even longer ago it
was an
sccs.swarthmore.edu
address. Summing subscriber
counts, I see:
- Feedly: 573
- inoreader.com: 87
- NewsBlur: 62
- Feedbin: 50
- theoldreader.com: 34
- Dreamwidth Studios: 7
- BazQux: 5
- Bloglovin: 2
- Feed Wrangler: 2
- pine.blog: 1
Different services fetched at different intervals. Taking the shortest interval for each distinct User-Agent:
- Feedly: 7min
- Feedbin: 15min
- Bloglovin: 30min
- Dreamwidth Studios: 30min
- Feed Wrangler: 30min
- NewsBlur: 30min
- BazQux: 40min
- inoreader.com: 1hr
- theoldreader.com: 2hr
- pine.blog: 24hr
I use QuietRSS which sets its User-Agent to "Mozilla/5.0 (Windows NT 6.1) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/77.0.3865.120 Safari/537.36"
Source: https://github.com/QuiteRSS/quiterss/blob/271c55756dcf19ca163a603a54d12f165548a602/src/main/globals.cpp#L98
Masquerading as Chrome is a mildly inconsiderate choice for an RSS reader to make, especially in not including a token for their own site. User Agent strings for visiting websites are a mess because of a history of people coding only to the dominant browser, but RSS does not have that history.
You do see things like Feedly using
Feedly/1.0 (+http://www.feedly.com/fetcher.html; 452 subscribers; like FeedFetcher-Google)
, where they include theFeedFetcher-Google
token, but there's really no reason to pretend to be a browser.Looks like QuiteRSS has pretended ... (read more)