Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add script to automatically update source first_story #12

Open
rahulbot opened this issue Jan 14, 2025 · 0 comments
Open

add script to automatically update source first_story #12

rahulbot opened this issue Jan 14, 2025 · 0 comments
Assignees
Labels
enhancement New feature or request

Comments

@rahulbot
Copy link

Our sources have a first_story field on them. We should fill that in regularly searching for the oldest (reasonable) date on each source. Smart to re-run every 6 months or so because we sometimes ingest historical data. Easy to implement if we can run a query for each source for dates from 2000 till now and sort by pub_date ASC. Perhaps don't do this for sources that have less than 100 articles because that's an indicator that we don't get regular data from them? A simple data science script so I'm logging it here, but probably best implemented as a Django management command like update-stories-per-week.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants