Schema for Imported Tweets

The schema for the imported tweets is shown in the following table:
Column Name
Column Type
Length
Description
author
Character
128
The author’s screen name.
authordescription
Character
1024
The author’s description.
authorfavouritecount
Numeric
8
The number of tweets the author has indicated as a “favorite”.
authorfollowercount
Numeric
8
The number of followers the author had when the tweet was imported from Twitter.
authorfriendcount
Numeric
8
The number of users the author was following when the tweet was imported from Twitter.
authorid
Numeric
8
Twitter’s unique ID for the author.
authorimageurl
Character
1024
A link to the author’s profile image.
authorlang
Character
2
The BCP 47 code for the author’s self-declared user interface language.
authorlocation
Character
128
The author’s self-declared location.
authorname
Character
128
The author’s self-declared name in the author’s profile.
authortimezone
Character
256
The author’s self-declared time zone.
authorurl
Character
1024
A URL provided by the author in the author’s profile.
body
Character
1024
The body of the tweet.
deviceinfo
Character
1024
The utility that was used to post the tweet. It is represented as an HTML-formatted string.
docid
Numeric
8
Twitter’s unique ID for the tweet. You can specify this variable as a unique row identifier when working with text analysis and word cloud visualizations.
doclatitude
Numeric
8
The latitude value of the tweet’s coordinates (if available). If it is not available, then a missing value is populated.
doclongitude
Numeric
8
The longitude value of the tweet’s coordinates (if available). If it is not available, then a missing value is populated.
isretweet
Numeric
8
A value of 1 indicates that the tweet is a retweet. Otherwise, the value is 0.
referenceauthor
Character
128
Screen name of the user to whom this tweet was a reply. If the tweet is not a reply, then this field is empty.
referenceauthorid
Numeric
8
User ID of the user to whom this tweet was a reply. If the tweet is not a reply, then the value for this field is set to -1.
publisheddatetime
Numeric
8
The tweet’s published date and time as a SAS datetime value (based on the number of seconds since January 1, 1960 at midnight).
publisheddatetimestr
Character
34
The tweet’s published date and time in string format. Example: October 24, 2013 6:56:25 PM GMT.
tags
Character
150
A semi-colon separated list of hash tags that are mentioned within the tweet.
listoflinks
Character
1024
A semi-colon separated list of URLs that are included in the body of the tweet. Expanded links are used where available.
mentionedusernames
Character
256
A semi-colon separated list of names of twitter users mentioned within the tweet.
mentionedusers
Character
256
A semi-colon separated list of screen names of twitter users mentioned within the tweet.
retweetcount
Numeric
8
The number of times this tweet has been retweeted when the tweet was imported from Twitter.
The schema for tweets is different from the schema that Twitter uses for tweets, users, and entities. The following list provides URLs to the information from Twitter for comparison purposes:
Last updated: January 8, 2019