Column Name
|
Column Type
|
Length
|
Description
|
---|---|---|---|
author |
Character
|
128
|
The author’s
screen name.
|
authordescription |
Character
|
1024
|
The author’s
description.
|
authorfavouritecount |
Numeric
|
8
|
The number of tweets
the author has indicated as a “favorite”.
|
authorfollowercount |
Numeric
|
8
|
The number of followers the author had when the tweet was imported from Twitter.
|
authorfriendcount |
Numeric
|
8
|
The number of users the author was following when the tweet was imported from Twitter.
|
authorid |
Numeric
|
8
|
Twitter’s unique
ID for the author.
|
authorimageurl |
Character
|
1024
|
A link to the author’s
profile image.
|
authorlang |
Character
|
2
|
The BCP 47 code for the author’s self-declared user interface language.
|
authorlocation |
Character
|
128
|
The author’s
self-declared location.
|
authorname |
Character
|
128
|
The author’s
self-declared name in the author’s profile.
|
authortimezone |
Character
|
256
|
The author’s self-declared time zone.
|
authorurl |
Character
|
1024
|
A URL provided by the
author in the author’s profile.
|
body |
Character
|
1024
|
The body of the tweet.
|
deviceinfo |
Character
|
1024
|
The utility that was used to post the tweet. It is represented as an HTML-formatted
string.
|
docid |
Numeric
|
8
|
Twitter’s unique ID for the tweet. You can specify this variable as a unique row identifier when working with text analysis and word cloud visualizations.
|
doclatitude |
Numeric
|
8
|
The latitude value of the tweet’s coordinates (if available). If it is not available,
then a missing value is populated.
|
doclongitude |
Numeric
|
8
|
The longitude value of the tweet’s coordinates (if available). If it is not available,
then a missing value is populated.
|
isretweet |
Numeric
|
8
|
A value of 1 indicates that the tweet is a retweet. Otherwise, the value is 0.
|
referenceauthor |
Character
|
128
|
Screen name of the user to whom this tweet was a reply. If the tweet is not a reply,
then this field is empty.
|
referenceauthorid |
Numeric
|
8
|
User ID of the user to whom this tweet was a reply. If the tweet is not a reply, then
the value for this field is set to
-1.
|
publisheddatetime |
Numeric
|
8
|
The tweet’s published date and time as a SAS datetime value (based on the number of seconds since January 1, 1960 at midnight).
|
publisheddatetimestr |
Character
|
34
|
The tweet’s published date and time in string format. Example:
October
24, 2013 6:56:25 PM GMT .
|
tags |
Character
|
150
|
A semi-colon separated list of hash tags that are mentioned within the tweet.
|
listoflinks |
Character
|
1024
|
A semi-colon separated list of URLs that are included in the body of the tweet. Expanded
links are used where available.
|
mentionedusernames |
Character
|
256
|
A semi-colon separated list of names of twitter users mentioned within the tweet.
|
mentionedusers |
Character
|
256
|
A semi-colon separated list of screen names of twitter users mentioned within the
tweet.
|
retweetcount |
Numeric
|
8
|
The number of times this tweet has been retweeted when the tweet was imported from
Twitter.
|