Column Name
|
Column Type
|
Length
|
Description
|
---|---|---|---|
author
|
Character
|
128
|
The author’s
screen name.
|
authordescription
|
Character
|
1024
|
The author’s
description.
|
authorfavouritecount
|
Numeric
|
8
|
The number of tweets
the author has “favorited.”
|
authorfollowercount
|
Numeric
|
8
|
The number of followers
the author had when the tweet was imported from Twitter.
|
authorfriendcount
|
Numeric
|
8
|
The number of users
the author was following when the tweet was imported from Twitter.
|
authorid
|
Numeric
|
8
|
Twitter’s unique
ID for the author.
|
authorimageurl
|
Character
|
1024
|
A link to the author’s
profile image.
|
authorlang
|
Character
|
$2
|
The BCP 47 code for
the author’s self-declared user interface language.
|
authorlocation
|
Character
|
128
|
The author’s
self-decleared location.
|
authorname
|
Character
|
128
|
The author’s
self-declared name in the author’s profile.
|
authortimezone
|
Character
|
256
|
The author’s
self-declared time zone.
|
authorurl
|
Character
|
1024
|
A URL provided by the
author in the author’s profile.
|
body
|
Character
|
1024
|
The body of the tweet.
|
deviceinfo
|
Character
|
1024
|
The utility that was
used to post the tweet. It is represented as an HTML-formatted string.
|
docid
|
Numeric
|
8
|
Twitter’s unique
ID for the tweet. You can specify this variable as a unique row identifier
when working with text analysis and word could visualizations.
|
doclatitude
|
Numeric
|
8
|
The latitude value of
the tweet’s coordinates (if available). If it is not available,
then a missing value is populated.
|
doclongitude
|
Numeric
|
8
|
The longitude value
of the tweet’s coordinates (if available). If it is not available,
then a missing value is populated.
|
isretweet
|
Numeric
|
8
|
A value of 1 indicates
that the tweet is a retweet. Otherwise, the value is 0.
|
referenceauthor
|
Character
|
128
|
Screen name of the user
to whom this tweet was a reply. If the tweet is not a reply, then
this field is empty.
|
referenceauthorid
|
Numeric
|
8
|
User ID of the user
to whom this tweet was a reply. If the tweet is not a reply, then
the value for this field is set to -1.
|
publisheddatetime
|
Numeric
|
8
|
The tweet’s published
date and time as a SAS datetime value (based on the number of seconds
since January 1, 1960).
|
publisheddatetimestr
|
Character
|
32
|
The tweet’s published
date and time in string format. Example: October
24, 2013 6:56:25 PM GMT.
|
tags
|
Character
|
128
|
A semi-colon separated
list of hash tags that are mentioned within the tweet.
|
listoflinks
|
Character
|
1024
|
A semi-colon separated
list of URLs that are included in the body of the tweet. Expanded
links are used where available.
|
mentionedusernames
|
Character
|
256
|
A semi-colon separated
list of names of twitter users mentioned within the tweet.
|
mentionedusers
|
Character
|
256
|
A semi-colon separated
list of screen names of twitter users mentioned within the tweet.
|
retweetcount
|
Numeric
|
8
|
The number of times
this tweet has been retweeted when the tweet was imported from Twitter.
|