NYT has a public api that can be used to track some so-called "stealth edits". Full text is not supported, but the API has endpoints that provide headlines, abstracts, lead paragraphs, and article word counts.
Everything should work. Headlines that do not appear to have changed are resulting in different MD5 hashes and being duplicated in database. I will fix that at some point.
- why are some articles/edits missing?
- The tracker uses the Archive endpoint, which is only updated three times per day (around 3:30PT, 11:30PT, and 19:30PT). Articles can be published and edited before the tracker sees them. If you do not like this, build your own. It takes like 15 minutes.
article info:
- article_id
- 2730d085-4fe5-5930-b62b-9fd47d79f760
- pub_date
- 2024-10-23 04:00:26
- section_name
- World
- document_type
- article
- web_uri
- https://www.nytimes.com/2024/10/23/world/middleeast/mideast-war-israel-iran-attack.html
history:
2024-10-23 11:45:08: word count changed to 1094 words.
2024-10-23 11:45:08: abstract changed
2024-10-23 11:45:08: lead paragraph changed.
2024-10-23 19:45:10: word count changed to 1097 words.
2024-10-23 19:45:10: abstract changed
2024-10-23 19:45:10: lead paragraph changed.
2024-10-24 03:45:10: word count changed to 1091 words.
2024-10-24 03:45:10: abstract changed
2024-10-24 03:45:10: lead paragraph changed.
2024-10-25 19:45:13: word count changed to 1101 words.
2024-10-25 19:45:13: abstract changed
2024-10-25 19:45:13: lead paragraph changed.
2024-10-26 03:45:09: word count changed to 1104 words.
2024-10-26 03:45:09: abstract changed
2024-10-26 03:45:09: lead paragraph changed.
search archives for older versions:
check archive.today for copies of this article.
check archive.org wayback machine for copies of this article.