Data dictionary

This page lists the different pieces of data that may be used in queries or returned in the API response, along with a description of what they are.

Fields in API response

We provide the results of several calculations that summarise the social performance of a piece of content, or predict it.

FieldNameDescription
authorAuthorThe author of the content, when available. This field is available when searching for YouTube content.
authorsAuthorsA list of the author(s) of the content, when available.
avgAverageIn the stats endpoint, the average engagement for the posts that match your search query.
commentsFacebook CommentsThe number of comments made on posts relating to a piece of content.
channelChannelThe URL of the YouTube channel.
countCountIn the stats endpoint, the total number of posts that match your search query.
countryCountryThe human-readable country name for the publisher.
country_codeCountry CodeThe ISO 3166 country code that represents the country of the publisher.
created_atCreated AtThe date and time that an Instagram post was published. A UNIX timestamp in milliseconds.
delta_periodDelta PeriodTime gap between the latest 2 social-engagement-data collection point for a piece of content. This frequency could take place for a maximum of 7 days

If 7 days has been reached, we're only checking FB scores once every 7 days until the content reaches 30-days old
delta_period_unitDelta Period UnitThe value is always m, representing minutes. This is the unit for the delta_period.
delta_timeDelta TimeThe same as delta_period.
entitiesEntitiesThe proper nouns extracted from a piece of content.
excerptExcerptA short extract of the content, when available.
external_linkExternal LinkIf a Facebook post contains a link to an external piece of content (eg. a news article), this is that URL.
favourite_countTwitter Likes CountThe number of likes that the specified post has received. These used to be called ‘favourites’ in the Twitter world, and this field has retained that name to prevent making a breaking change to our API.
fb_dataFacebook DataThis is a collection of different data points that relate to Facebook.
fb_overperformingFacebook OverperformingA calculation of how a piece of content's performance on Facebook compares to its usual average performance.
fb_predicted_interactionsFacebook Predicted InteractionsThe total number of Facebook engagements that we predict a piece of content will achieve.
fb_sponsor_tagsFacebook Sponsor TagsSponsors of the Facebook post, if available.
fb_storyFacebook StoryA description generated by Facebook about an event. For example, Tom shared 3 photos or Jane shared a link.
fb_totalFacebook StatsFor the /stats endpoint, a grouping of statistics for your chosen aggregation that relate to Facebook engagement.
fb_total_interactionsFacebook Total InteractionsThe total number of engagements (reactions, shares, comments) a piece of content has had on Facebook. The naming of this field differs between the fbPosts endpoint (total_engagement_count) and the fbInfluencers endpoint (fb_total_interactions).
followers_countFacebook or Twitter Followers CountThe number of people following a given Facebook page or Twitter account.
following_countTwitter Following CountThe number of other Twitter accounts followed by a given Twitter account.
has_videoVideo FlagA flag to show whether we detected video in the content.
headlineHeadlineThe heading of the content.
idInstagram Unique IdentifierAs with uuid, this is an alphanumeric string that uniquely identifies a piece of Instagram content in the NewsWhip database.
ig_commentsInstagram CommentsThe number of comments that have been made on an Instagram post.
ig_filterInstagram FilterOn Instagram, if a filter was applied to the image, the name of that filter appears here.
ig_full_user_nameInstagram Full UsernameThe human-readable (eg. with spaces) name of an Instagram user.
ig_likesInstagram LikesThe number of likes that have been made on an Instagram post.
ig_linkInstagram LinkThe URL of an Instagram post.
ig_post_textInstagram Post TextThe written text associated with the original Instagram post.
ig_post_typeInstagram Post TypeThe type of Instagram post the content is. For example, image or carousel.
ig_profile_imageInstagram Profile ImageThe URL of the Instagram user's avatar.
ig_tagsInstagram TagsThe hashtags used in an Instagram post.
ig_user_nameInstagram UsernameThe handle of an Instagram user.
image_linkImage LinkIf we have detected an important image for the content, this is the link to it.
instagram_idInstagram IDA unique identifier for the Instagram post, provided by Instagram.
is_live_videoLive Video FlagA flag to show whether the content is a Facebook Live Video.
keywordsKeywordsA list of keywords sometimes provided by the publisher. These are provided verbatim, and so may not be consistent from publisher to publisher.
li_countLinkedIn CountThe number of engagements a piece of content has had on LinkedIn.

See Known issues for updates on the availability of LinkedIn data.
li_dataLinkedIn DataThis is a collection of different data points that relate to LinkedIn.
likes_countFacebook or Twitter Likes CountFor Facebook, the number of posts for which this user has left a like, love, wow, haha, sad, or angry reaction. For Twitter, the number of posts that this user has liked.
keyStats KeyIn the stats endpoint, each entry for the aggregation option that you have chosen.
likesFacebook LikesThe number of likes of posts relating to a piece of content.
linkLinkThe URL of the content.
linkedinLinkedIn StatsFor the /stats endpoint, a grouping of statistics for your chosen aggregation that relate to LinkedIn engagement.
maxMaximum EngagementIn the stats endpoint, the maximum number of engagements that any post that matches your search query had.
max_fb_interactionsMaximum Facebook InteractionsFor Facebook, the number of interactions on the user’s most influenced story.
max_nw_scoreMaximum VelocityThe maximum social velocity of a piece of content so far.

This is a score that we give to stories based on how fast they are spreading on Facebook, Twitter, and LinkedIn. The scores tell you which articles and videos are captivating the most attention today, in whatever niche or sector you’re interested in. The higher the score, the more engagement we are seeing the content getting.
max_retweet_valueMaximum RetweetsFor Twitter, the number of retweets on this user’s most influenced article.
minMinimum EngagementIn the stats endpoint, the minimum (lowest) number of engagements that any post that matches your search query had.
nw_max_scoreMaximum VelocityThe maximum social velocity of a piece of content so far.

This is a score that we give to stories based on how fast they are spreading on Facebook, Twitter, and LinkedIn. The scores tell you which articles and videos are captivating the most attention today, in whatever niche or sector you’re interested in. The higher the score, the more engagement we are seeing the content getting.
nw_scoreCurrent VelocityThe current social velocity of a piece of content.
page_idFacebook Page IDThe unique identifier for a Facebook page.
page_nameFacebook Page NameThe name of a Facebook page.
pi_countPinterest CountThe number of pins a piece of content has had on Pinterest.
pi_dataPinterest DataThis is a collection of different data points that relate to Pinterest.
pinterestPinterest StatsFor the /stats endpoint, a grouping of statistics for your chosen aggregation that relate to Pinterest engagement.
post_typeFacebook Post TypeThe type of Facebook post the content is. For example, a status.
predicted_interactionsPredicted InteractionsThe total number of Facebook engagements and Twitter Influencer Shares (combined) that we predict a piece of content will achieve.
predicted_timestampPredicted Interactions TimestampThe date and time that we calculate the Predicted Interactions number will be reached.
publication_timestampPublication TimestampThe date and time that the content was published. It is a UNIX timestamp in milliseconds.
publisherPublisherThe name, usually the domain, of the publisher of the content.
reactionsFacebook ReactionsA breakdown of the types of reactions that have been made on a post, and the number of each. Reactions could be: comments, likes, shares, loves, wows, hahas, sads, or angrys.
recent_fb_countsFacebook Total Count DeltaThis is the same as total_count_delta for Facebook. This is a legacy field that may be removed in a future update.
recent_tw_countsTwitter Total Count DeltaThis is the same as total_count_delta for Twitter. This is a legacy field that may be removed in a future update.
relatedStoriesRelated StoriesA list of stories similar to the parent story.

This works by looking for headlines with 4 or more words in common, excluding stop words (eg. and, or, but etc), within the query result. So, if you run a query that returns 3000 results, and within that you have several headlines that are similar, these will appear in the related stories section of the response.
rt_countTwitter Retweets CountThe number of posts this user has retweeted.
sharesFacebook SharesThe number of times posts relating to a piece of content were shared on Facebook.
sourceSourceThis is a collection of different data points that relate to the publisher.
statsStatsIn the stats endpoint, a group of social engagement metrics for a given aggregation.
statuses_countTwitter Posts CountThe number of Tweets the user has made.
std_deviationStandard DeviationIn the stats endpoint, the amount of variation of engagement of the posts that match your search query.
sumSumIn the stats endpoint, the sum total of all engagements for the posts that match your search query.
sum_of_squaresSum of SquaresThe sum, over all observations, of the squared differences of each observation from the overall mean. In the stats endpoint, this refers to the engagement of the posts that match your search query.
thumbnailThumbnailThe URL of the video thumbnail for YouTube content.
topicsTopics (sometimes referred to as Categories)The theme associated with the content's source, based on Topics.
totalStats Engagement TotalIn the stats endpoint, the total engagement across all available social networks the posts that match your search query.
total_matchTotal Search MatchesThe total number of matches identified for your query.
total_count_deltaTotal Count DeltaThe difference between the current number of engagement and the number at the time of our previous check.
total_engagement_countFacebook Total Engagement CountThe total number of engagements (reactions, shares, comments) a piece of content has had on Facebook.
tw_countTwitter CountThe number of Influencer Shares a piece of content has had on Twitter.
tw_dataTwitter DataThis is a collection of different data points that relate to Twitter.
tw_overperformingTwitter OverperformingA calculation of how a piece of content's performance on Twitter compares to its usual average performance.
tw_predicted_interactionsTwitter Predicted InteractionsThe total number of Twitter Influencer Shares that we predict a piece of content will achieve.
twitterTwitter StatsFor the /stats endpoint, a grouping of statistics for your chosen aggregation that relate to Twitter engagement.
uuidUnique IdentifierAn alphanumeric string that uniquely identifies a piece of content in the NewsWhip database.
varianceVarianceThe expectation of the squared deviation of a random variable from its mean. In the stats endpoint, this refers to the engagement of the posts that match your search query.
videosEmbedded VideosA list of URLs to videos embedded in a piece of content. For example, a YouTube video embedded in a blog post.
youtube_postYouTube PostThe written content that is associated with a YouTube video.
ytCommentsYouTube CommentsThe number of comments that have been made on a YouTube video.
ytLikesYouTube LikesThe number of times that people have liked (thumb up) a YouTube video.
ytTotalYouTube Engagement TotalThe sum of views, likes and comments for a YouTube video.
ytViewsYouTube ViewsThe number of views a YouTube video has.

Sorting options

This table lists common options that may be available for sorting data for different endpoints. It also provides a brief definition of what those sort options mean.

Note that not all endpoints support all sorting options. Check the documentation for the endpoint you are using to be sure.

FieldNameDescription
created_atCreated AtThe published date of the matched article
fb_total_engagementFacebook Total Engagement
fb_tw_overperformingFacebook and Twitter OverperformingThe sum of Facebook and Twitter Overperforming scores
fb_tw_and_liFacebook, Twitter, and LinkedInThe sum of Facebook, Twitter, and LinkedIn engagements
fb_tw_li_and_piFacebook, Twitter, LinkedIn, and PinterestThe sum of Facebook, Twitter, LinkedIn, and Pinterest engagements
linkedinLinkedInThe equivalent of li_count
nw_max_scoreNewsWhip Maximum ScoreThe highest velocity the content has achieved so far.
nw_scoreNewsWhip ScoreThe current velocity of the content.
predicted_interactionsPredicted InteractionsThe sum of the number of interactions we predict that the content will have on Facebook and Twitter combined.
twitterTwitterThe equivalent of tw_count

v1/stats aggregation options

In the v1/stats API it is required that you aggregate the results on a field of your choosing. The definitions of the available fields are listed below.

FieldNameDescription
authorsAuthorsThe person who produced a piece of content.
categoriesCategoriesThe topic of the content, as defined in Topics.
country_codeCountry CodeThe country associated with the publication. A two letter country code, as defined in the ISO 3166 standard.
domainDomainSee below.
domainsDomainsSee below.
languageLanguageThe language associated with the publication. A two letter code, as described in Supported Languages.
publisherPublisherSee below.

For domain, domains, and publisher an understanding of our data gathering methodology is needed to make a decision on which may be appropriate. Here's a walkthrough of how we populate these fields:

  • Let’s suppose we decide to start tracking content for uk.finance.yahoo.com. We add that to our database.
  • We set the publisher to yahoo.com by stripping out the subdomains.
  • We set the domain to uk.finance.yahoo.com because that’s what we decided to track in the first place.
  • We set the domains to yahoo.com, finance.yahoo.com, and uk.finance.yahoo.com, as these are all the possible combinations of the domain and subdomain.

So, when you choose to aggregate on one of these three, you’re choosing from one of those methodologies.