twitter

This Repo contains Jupyter notebooks allowing to collect, clean, and classify twitter data. It is organized in the following way:

(1) Imports:

Extract subsamples from a historical dataset of Tweets based on specific criteria

(2) Locations:

Geocode Twitter users' account location, allowing to map twitter data to official statistics
Reverse geocode tweets with geocoordinates

(3) Users:

Extract list of users whose account location was properly geocoded
Lookup located users' profile using the Twitter API, identifying users whose account are not protected

(4) Timelines:

(5) Tweets:

(6) Mentions:

(7) Classification:

Compute similarity of each tweet to a given sentence, allowing to find semantically similar tweets
Feed a sample of tweets into a Qualtrics survey to create labels on Amazon Mechanical Turk
Classify tweets based on the labor market status of Twitter users

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
1-imports		1-imports
2-locations		2-locations
3-users		3-users
4-timelines		4-timelines
5-tweets		5-tweets
6-mentions		6-mentions
7-classification		7-classification
sh		sh
.gitignore		.gitignore
README.md		README.md

Provide feedback