Reddit links
Agent Source token | 93df90e8-1881-40fc-b19d-49d78cc9ee24 |
Consumes Artifacts | subreddit-list |
Subject coverage | All Links posted in subreddits on the subreddit-list . |
Object coverage | All DOIs, all article landing pages. |
Data contributor | |
Data origin | Subreddit feeds and websites that they link to. |
Freshness | Every few hours. |
Identifies | Linked DOIs, unlinked DOIs, Landing Page URLs |
License | Creative commons CC0 1.0 Universal (CC0 1.0) |
Looks in | The text of Webpages linked to on specified subreddits. |
Name | Reddit Links |
Operated by | Crossref |
Produces Evidence Records | Yes |
Produces relation types | discusses |
Source ID | reddit-links |
Updates or deletions | None expected |
What it is
Users share and discuss links on Reddit. The Reddit links source looks at the links that are shared in a specific selection of subreddits (Reddit discussion forums) and visits the webpages that are linked. The selection of Subreddits that are visited are specified in the subreddit-list
Artifact. The subreddits tend to be scientific or academic in focus.
This is different to the Reddit source, which looks at the discussions themselves.
What it does
- Visits each subreddit in turn.
- Fetches all links that were shared since the last visit.
- Visits each link and looks in the HTML of the webpage for links to DOIs, links to article landing pages and unlinked DOIs.
Example Event
{
"license": "https://creativecommons.org/publicdomain/zero/1.0/",
"obj_id": "https://0-doi-org.libus.csd.mu.edu/10.1126/science.1182238",
"source_token": "93df90e8-1881-40fc-b19d-49d78cc9ee24",
"occurred_at": "2016-07-16T05:33:05Z",
"subj_id": "http://0-rsos-royalsocietypublishing-org.libus.csd.mu.edu/content/3/7/160131",
"id": "0004a308-b218-47f5-bf58-82bc2c245bc7",
"evidence_record": "https://0-evidence-eventdata-crossref-org.libus.csd.mu.edu/ evidence/20170410-reddit-links-fd42bae4-aa51-4cd8-a022-3b3c3e501949",
"terms": "https://0-doi-org.libus.csd.mu.edu/10.13003/CED-terms-of-use",
"action": "add",
"subj": {
"pid": "http://0-rsos-royalsocietypublishing-org.libus.csd.mu.edu/content/3/7/160131"
},
"source_id": "reddit-links",
"obj": {
"pid": "https://0-doi-org.libus.csd.mu.edu/10.1126/science.1182238",
"url": "http://0-dx-doi-org.libus.csd.mu.edu/10.1126/science.1182238"
},
"timestamp": "2017-04-10T16:35:52Z",
"relation_type_id": "discusses"
}
Evidence Record
- Includes batches of
landing-page-url
type observations.
Edits / deletion
We don't expect to have to edit or delete any Events.
Quirks
We will only visit subreddits on the list. However we monitor the Events generated from Reddit source and manually review those.
Failure modes
- Reddit API may be unavailable.
- Publisher sites may block the Event Data Bot collecting landing pages.
- Publisher sites may prevent the Event Data Bot collecting landing pages with robots.txt