Timeline on twarc-archive.py
See original GitHub issueHi,
Is it possible to use the timeline
functionality on twarc-archive.py
? It would seem that twarc-archive only uses GET search/tweets
and not GET statuses/user_timeline
. Is that correct?
My use case requires to use the timeline because of the bigger requests calls per window, and I have to do it through an extended period of time, so mining only the newer tweets automatically, is exactly what I’m looking for.
Because, as far as I know, if I do:
twarc timeline slarrain > slarrain.jsonl
2 days in a row, it just overwrites the slarrain.jsonl
file. And If I, instead, give it a new name, like slarrain2.jsonl
, It will re-mine the tweets that are already on the old file. So it would seem that what I’m looking for is exactly the functionality that twarc-archive provides, but not for search (which I’m currently using) but for timelines.
Is that possible?
In any case, thank you so much for twarc
. Is beyond awesome.
Issue Analytics
- State:
- Created 5 years ago
- Comments:10 (7 by maintainers)
Top GitHub Comments
You’re welcome! Thanks for the kind words, they make everything worthwhile.
Thank you @jason-lange – it’s always great to hear people find it useful, even if it’s a bit difficult at first. Thanks also for the pointer to @alblaine’s tutorial, it had slipped my radar and points to a possible way to improve the twarc documentation.