Download - Five Things You Didn't Know DataSift Can Do
![Page 1: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/1.jpg)
Brad HubbardProduct Manager, Developer Relations DataSift
Five Things You Didn’t Know DataSift Can Do
#DSwebinar
![Page 2: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/2.jpg)
HUMAN DATA INTELLIGENCE
FILTER TAG • ENRICH
STORE
Stream products will be covered todayTo see PYLON (our aggregated, anonymized Facebook topic data), join our next live demo:
http://lp.datasift.com/20150701-Live-SE-Demo-Registration
DataSift is of Two Minds: Indexed Data & Streaming
#DSwebinar
VEDO
![Page 3: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/3.jpg)
2011 1K 4
Launched
• San Francisco• New York• London• Reading, UK
Customers across 40 countries
2B
Items processed
per day
(These don’t count toward the 5 things)
Global offices:
#DSwebinar
![Page 4: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/4.jpg)
Brave New Data World
of all digital data created by consumers
emails a day
of US adults’ location is known
increase in global data by 2020
ThoughtsEm
otio
ns
LIKES
Dis
likes
Intentions IdeasCurrent Events
GEOOccupationAge
Top
icsGenderIdeas
Gender
Occupation
Intentions
Age
Th
ou
gh
tsG
EO
Dislikes
Age
Ideas
ThoughtsAge
Intentions
Current Events
Current Events
Emotions
GEO
IdeasGEO
#DSwebinar
![Page 5: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/5.jpg)
Sources of Human-Generated Data
BLOGS & NEWS INSIDE YOUR BUSINESS
SOCIAL NETWORKS
#DSwebinar
![Page 6: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/6.jpg)
The Complexity of Human Data
VOLUMEVARIET
YVELOCITY
Billions of users
Noisy
Generated in real time
per second
Post vs blog vs like
Terabytes per day
Ambiguous
Big spikesUnstructured
#DSwebinar
![Page 7: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/7.jpg)
Turn Human Data into Meaning
#DSwebinar
![Page 8: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/8.jpg)
Unify Human Data
#DSwebinar
![Page 9: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/9.jpg)
9
We apply structure to the chaotic world of human data
#DSwebinar
![Page 10: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/10.jpg)
Tencent Weibo
Sina Weibo
Google+
YouTubeInstagram
LexisNexis
Wikipedia
Wordpress
TumblrIntense Debate
DisqusNewsCred
TopixJiveTwitter
EDGAR NewsVideoIMDBYammer
Unifying data from across the web
#DSwebinar
![Page 11: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/11.jpg)
Filtering Human Data with CSDL
#DSwebinar
![Page 12: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/12.jpg)
Filter: CSDL Data Processing Language
![Page 13: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/13.jpg)
WRITE ONCE • USE MANYFilters against generic objects or get source-specific
#DSwebinar
![Page 14: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/14.jpg)
Rules can contain millions of tag and filter criteria, no need to limit yourself
INFINITE COMPLEXITY
#DSwebinar
![Page 15: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/15.jpg)
Enrich Human Data
#DSwebinar
![Page 16: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/16.jpg)
Identifies links in social posts and fetches header
dataAllowing you to filter against link content
LINKS AUGMENTATION
#DSwebinar
![Page 17: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/17.jpg)
LANGUAGE DETECTIONWrite filters on a per-language basis, or limit
yourself to only certain languages
#DSwebinar
![Page 18: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/18.jpg)
Location either disclosed by user or listed in profile
GENDER DETECTION USING PROFILES AND NAME + LANGUAGE
#DSwebinar
![Page 19: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/19.jpg)
SENTIMENT AND TOPICS Likely positive • Neutral • Likely Negative
Topic detection (looking for nouns and disambiguating them)
#DSwebinar
![Page 20: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/20.jpg)
Categorization, Scoring and Tagging
#DSwebinar
![Page 21: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/21.jpg)
VEDO enables automatic
classification of Human Data
based on it’s meaning
Apply Data Science
#DSwebinar
![Page 22: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/22.jpg)
OFF THE SHELF CLASSIFIERSEnable automatic scoring and classification
#DSwebinar
![Page 23: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/23.jpg)
CUSTOM TAXONOMIESHierarchal rules to mach your business
#DSwebinar
![Page 24: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/24.jpg)
CUSTOM SCORING SYTEMTo expose meaning hidden deep within
unstructured, text-rich data
#DSwebinar
![Page 25: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/25.jpg)
DeliveryUse Everywhere
#DSwebinar
![Page 26: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/26.jpg)
CONSUME A JSON STREAM DIRECTLY
#DSwebinar
![Page 27: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/27.jpg)
Send your data to any of these pre-built connectors
#DSwebinar
![Page 28: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/28.jpg)
We handle the infrastructure and send you the data you need
#DSwebinar
![Page 29: Five Things You Didn't Know DataSift Can Do](https://reader036.vdocuments.us/reader036/viewer/2022062514/55bad40fbb61ebd9168b4605/html5/thumbnails/29.jpg)
THANK YOU
#DSwebinar