json: { "language": "en", "orientation": "up",...

35
Update d Agenda 12:00PM–1:00PM | Session #1 1:00PM–2:00PM | Lunch

Upload: carol-harris

Post on 19-Dec-2015

237 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Updated Agenda

12:00PM–1:00PM |

Session #1

1:00PM–2:00PM | Lunch

Page 2: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",
Page 3: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Dr. Harry ShumExecutive Vice PresidentTechnology and Research

Microsoft Project Oxford:Adding Smart to Your Applications

2-613

Ryan GalgonSenior PMMicrosoft Project Oxford

Page 4: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

What is Project Oxford?

A portfolio of REST APIs and SDKs which enable developers to write applications which understand the content within the rapidly growing set of multimedia data

Page 5: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Project Oxford’s API services will help you understand and interact with audio, text, image, and video

Understand the data around your application

Page 6: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Microsoft Project Oxford Services

PROJECT OXFORD

Speech APIs LUIS (Language Understanding Intelligent Service)

Vision APIs Face APIs

Page 7: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Project Oxford’s models are trained using the same deep learning and machine learning techniques that power many products across Microsoft

Powerful models

Page 8: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Project Oxford allows you to focus on your application by easily including these services across platforms through simple REST APIs

Easy to use

Page 9: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Easily include Project Oxford Services

ProjectOxford.Face.Contract.Face[] detectionResults = new ProjectOxford.Face.Contract.Face[0];ProjectOxford.Face.Contract.IdentifyResult[] identifyResults = new ProjectOxford.Face.Contract.IdentifyResult[0];

using (var imageFileStream = Context.ContentResolver.OpenInputStream(imageUri)){

//Call detection and identification REST APIdetectionResults = await client.DetectAsync(imageStream: imageFileStream, analyzesAge: true, analyzesGender: true);

identifyResults = await client.IdentifyAsync(personGroupId, detectionResults.Select(face => face.FaceId).ToArray());}

ProjectOxford.Face.Contract.Face[] detectionResults = new ProjectOxford.Face.Contract.Face[0];ProjectOxford.Face.Contract.IdentifyResult[] identifyResults = new ProjectOxford.Face.Contract.IdentifyResult[0];

using (var imageFileStream = Context.ContentResolver.OpenInputStream(imageUri)){

//Call detection and identification REST API

Page 10: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

OCR

Speech Recognition

Text to Speech

Speech Intent Recognition

Determine Entities

Microsoft Project Oxford Services

PROJECT OXFORD

Face Grouping

Face Identification

Face DetectionAnalyze Image

Generate Thumbnail

Improve Models

Detect Intent

Face APIsVision APIsLUIS(Language Understanding Intelligent Service)

Speech APIs

Page 11: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Vision APIsAnalyze an ImageOCRGet Thumbnail

Page 12: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Understand content and features within an image

Analyze Image Service

Page 13: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Analyze Image – Example

Type of Image:

Clip Art Type 0 Non-clipart

Line Drawing Type 0 Non-Line Drawing

Black & White Image False

Content of Image:

Categories [{ “name”: “people_swimming”, “score”: 0.099609375 }]

Adult Content False

Adult Score 0.18533889949321747

Faces [{ “age”: 27, “gender”: “Male”, “faceRectangle”: {“left”: 472, “top”: 258, “width”: 199, “height”: 199}}]

Image Colors:

Dominant Color Background White

Dominant Color Foreground Grey

Dominant Colors White

Accent Color

Page 14: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Detect and recognize words within a photo

OCR Service

Page 15: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

OCR – Example

JSON:{

"language": "en","orientation": "Up","regions": [

{

"boundingBox": "41,77,918,440","lines": [

{

"boundingBox": "41,77,723,89",

"words": [

{

"boundingBox": "41,102,225,64",

"text": "LIFE"

},

{

"boundingBox": "356,89,94,62",

"text": "IS"

},

{

"boundingBox": "539,77,225,64",

"text": "LIKE"

}. . .

TEXT:LIFE IS LIKERIDING A BICYCLETO KEEP YOUR BALANCEYOU MUST KEEP MOVING

Page 16: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Scale and crop an image, while retaining key content

Smart Thumbnail Service

Page 17: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Smart Thumbnail – Example

Page 18: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Face APIsDetectionVerificationGroupingIdentification

Page 19: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Face API – DetectionDetection Result:JSON:[ { "faceRectangle": { "width": 109, "height": 109, "left": 62, "top": 62 }, "attributes": { "age": 31, "gender": "male", "headPose": { "roll": "2.9", "yaw": "-1.3", "pitch": "0.0" } "faceLandmarks": { "pupilLeft": { "x": "93.6", "y": "88.2" }, "pupilRight": { "x": "138.4", "y": "91.7" }, ...

INPUTIMAGE

FACIALRECTANGLE + LANDMARKS

DETECTION ATTRIBUTES

Page 20: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Verification Result:JSON:[

{"isIdentical":false,"confidence":0.01

}]

Face API – Verification

Given two faces, determine whether they are the same person

Page 21: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

CLUSTERED BY DETECTED PEOPLE

Face API – Grouping

Page 22: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

INPUT IMAGE

DETECTION

FACIALRECTANGLE + LANDMARKS

Face API – Create Person Object

COLLEAGUES

CREATE PERSON GROUP

COLLEAGUES

ADD PERSON

Chao Wang

COLLEAGUES

Chao Wang

Page 23: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

He is Chao Wang.

Face API – Identify

NEW INPUTIMAGE

IDENTIFY

Natalie Huber

GROUP PERSON OBJECTS

COLLEAGUES

RECOGNITION

Chao Wang

Page 24: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Speech APIs powered by BingVoice Recognition (Speech to Text)Voice Output (Text to Speech)

Page 25: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Voice Recognition

Converts spoken audio to text

Same backend which powers Cortana

Support for 7 languages at launch

en-US

en-GBde-DE

es-ESfr-FRit-IT

zh-CN

Page 26: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Duration of Audio < 15 seconds < 2 minutes

Final Result n-best choiceBest Choice, delivered at sentence pauses

Partial Results Yes Yes

********* Final N-BEST Results *********[0] Confidence=Normal Text="450 six St San Francisco."[1] Confidence=Normal Text="For 50 six St San Francisco."[2] Confidence=Normal Text="456th St San Francisco."[3] Confidence=Normal Text="450 six St in San Francisco."[4] Confidence=Normal Text="456 St San Francisco."

Voice Recognition ModesShort Form Long Form

450 6th St. San

Francisco

Page 27: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Voice RecognitionREST API Client Library

Any Windows, Android, iOSSUPPORTED PLATFORMS

Yes YesDATA SUPPORT

No YesMIC SUPPORT

No YesSILENCE DETECTION ON MIC

Short Short and longLENGTH OF UTTERANCE

n-best response backmultiple partial results, n-best (short) and multiple phrases (long)

NUMBER OF RESPONSES

Windows 10 has Speech APIs built in

Page 28: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Synthesize audio from

text, to speak to your users.

Synthesize audio from text via POST request

Maximum audio return of 15 seconds

17 languages supported at launch

Voice Output

<speak version="1.0" xmlns="http://www.w3.org/2001/10/synthesis" xmlns:mstts="http://www.w3.org/2001/mstts" xml:lang="en-US"><voice name="Microsoft Server Speech Text to Speech Voice (en-US, ZiraRUS)">

Synthesize audio from text, to speak to your users.

</voice></speak>

Page 29: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Language Understanding Intelligent ServiceDetermine IntentDetect EntitiesImprove Models

Page 30: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

{“entities”: [

{“entity”: “flight_delays”,“type”: “Topic”

}],“intents”: [

{“intent”: “FindNews”,“score”: 0.99853384

},{

“intent”: “None”,“score”: 0.07289317

},{

“intent”: “ReadNews”,“score”: 0.0167122427

},{

“intent”: “ShareNews”,“score”: 1.0919299E-06

}]

}

“News about flight delays”

Language Understanding Models

Page 31: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Lets you understand what your users are saying

Seamless integration with Speech Recognition

A few examples are enough to deploy an application

LUIS learns over time

Language Understanding Intelligent Service

Define Concepts

ProvideExamples

Active LearningDeploy

Page 32: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Getting started with Project Oxford

You can also subscribe directly from the Azure Management Portal Marketplace

Visit https://www.projectoxford.ai and select “Sign up” from any of the offered services

The Azure Management portal will launch

In 'Choose an Application or Service' page, select the service such as "Face APIs" from the list

Fill in the requested information to purchase a free tier

Find the service under the ‘marketplace’ tab

Select the service, and click on ‘Manage’

You now have your developer keys, ready to use in your applications

Page 33: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Microsoft Project Oxford

A portfolio of REST APIs and SDKs which enable developers to write applications which understand the content within the rapidly growing set of multimedia data

Page 34: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

Visit http://www.projectoxford.ai to learn more

Stop by the Project Oxford booth

Give us feedback in our forum

Don’t miss the Bing talk on the Vision APIs

Call to Action

Page 35: JSON: { "language": "en", "orientation": "Up", "regions": [ { "boundingBox": "41,77,918,440", "lines": [ { "boundingBox": "41,77,723,89",

© 2015 Microsoft Corporation. All rights reserved.