Replies: 3 comments 4 replies
-
@mlkorra Do you have the field descriptions as well? Couldn't find it in the Twint docs. |
Beta Was this translation helpful? Give feedback.
3 replies
-
Some fields (like mentions, urls etc) I wanted to clarify the output for. |
Beta Was this translation helpful? Give feedback.
1 reply
-
based on discussion with CIS folks, we've decided to scrape all data fields other than cashtag for descriptive research. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
The following are the tweets fetched by the twitter scraper (twint)
{"id":,
"conversation_id",
"created_at":,
"date":,
"time":,
"timezone":,
"user_id":,
"username":,
"name":,
"place":,
"tweet":,
"language":,
"mentions":,
"urls":,
"photos":,
"replies_count":,
"retweets_count":,
"likes_count":,
"hashtags":,
"cashtags":,
"link": "",
"retweet":,
"quote_url":,
"video": ,
"thumbnail":,
"near":,
"geo":,
"source":,
"user_rt_id":,
"user_rt":,
"retweet_id":,
"reply_to":,
"retweet_date":,
"translate":,
"trans_src":,
"trans_dest":}
Note :
Fields in bold are currently stored in the database
Additional Fields in the database :
"timestamp_of_scraping",
"type" -> whether the tweet is fetched using keyword,hashtag or userhandle,
"search" -> search term used to scrape,
"content_type" -> whether the tweet contains text,image,gif,video,
"s3_url" -> url of the media uploaded to s3 bucket,
Please,go through the data fields and mention/discuss if any of the fields is helpful for the task
Beta Was this translation helpful? Give feedback.
All reactions