Academic Search results truncated?
See original GitHub issueI’m noticing that I can only get a months worth of results back from the /2/tweets/search/all
endpoint. But I can get more as long as I use --end-time
.
For example searching for from:barackobama
yields tweets back to 2021-03-17 and then twarc stops:
twarc2 search from:barackobama --archive --flatten | jq .created_at
...
"2021-03-19T17:01:07.000Z"
"2021-03-18T13:00:08.000Z"
"2021-03-17T14:23:01.000Z"
"2021-03-17T14:23:01.000Z"
"2021-03-17T14:23:01.000Z"
But if I ask for more tweets I can get another month:
twarc2 search from:barackobama --archive --end-time 2021-03-17 --flatten | jq .created_at
...
"2021-02-19T21:04:07.000Z"
"2021-02-18T22:53:52.000Z"
"2021-02-18T19:02:40.000Z"
"2021-02-17T21:15:27.000Z"
"2021-02-15T21:13:15.000Z"
I would expect both commands to keep retrieving tweets back to barackobama’s first tweets? I don’t know if this is a problem in twarc or this is some kind of limitation of the API. Any insights would be welcome.
Issue Analytics
- State:
- Created 2 years ago
- Comments:6 (6 by maintainers)
Top Results From Across the Web
Search Tips: Truncation and Boolean Searching: Home
To truncate a search term, do a keyword search in a database, but remove the ending of the word and add an asterisk...
Read more >Why do truncation (*) searches sometimes return fewer results?
Truncation searches using an asterisk (*) allow the EBSCO search engine to expand the query into multiple possible keywords.
Read more >Truncation - Search the Literature: Tips and Tricks
Truncation is a technique frequently used in keyword searching, in which it helps reduce the number of variations you have to search on ......
Read more >Truncation - Getting the Most out of PubMed Medline
Truncation is a search method in which symbols are used in place of letters or words to help you broaden your search.
Read more >Truncation - Database Search Tips - LibGuides at MIT Libraries
Truncation: Truncation, also called stemming, is a technique that broadens your search to include various word endings and spellings.
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Yes, I think it’s a legit worry. But depending on your query you could max it out searching a month window too. I guess I was hoping people would get in the habit of using
--limit
to control how much data they get at once?I learned over in the forum post that it is documented that the the
start_time
has this default:https://developer.twitter.com/en/docs/twitter-api/tweets/search/api-reference/get-tweets-search-all
Unless there are objections I’m going to update twarc to default start_time to Twitter Epoch Time time (2006-03-21 00:00:00) to avoid confusion from users.