crowdsourcing speech data science and ai

18
Daniela Braga, PhD CEO [email protected] DefinedCrowd: Crowdsourcing, Speech Data Science, AI Crowdsourcing Week, June 15 th 2017

Upload: crowdsourcing-week

Post on 22-Jan-2018

231 views

Category:

Data & Analytics


2 download

TRANSCRIPT

Daniela Braga, PhDCEO

[email protected]

DefinedCrowd: Crowdsourcing,

Speech Data Science, AI

Crowdsourcing Week, June 15th 2017

definedcrowd confidential 3

Reason #1: machines need high quality data to learn

definedcrowd confidential 4

definedcrowd confidential 5

definedcrowd confidential 6

definedcrowd confidential 7

Reason #2: big data opportunity

definedcrowd confidential 8

Reason #2: big data

definedcrowd confidential 9

Reason #3: paradigm shift when teaching machines

definedcrowd confidential 10

definedcrowd confidential 12

definedcrowd confidential 13

definedcrowd confidential

DEMO

definedcrowd confidential 15

The challenges of crowdsourcing NLP data

Crowd quality Data quality

• Language tests• Job specific tests• Real Time Audits• Built-in language/spam

validators

• Referral system• System of tokens• Legal/privacy compliance

(under NDA)

Quality gateways

Controlled crowd

• Checking for suspicious crowd behavior (multiple accounts creation, peaks of activity, specific job spam, IP check against country of living)

Machine Learning

Data quality control

• Validation steps• Inter-annotator

agreements• Precision and Recall

metrics

definedcrowd confidential 16

DefinedCrowd combines the best of professional services with SaaS companies

definedcrowd confidential 17

FOLLOW US ON

or send me an email to [email protected]

Learn more at definedcrowd.com