data sience coffe
TRANSCRIPT
Chat Bot topic summary
TeamFICC Tech macro
Self introduction
• Noboru Kano– 2016 new grad
• Interesting topicsNLP(Natural language processing), Statistics, ML(Machine learning)
• Experience1year part time on a start up company as a ML engineer(Did 3 NLP projects)
• HobbyHandball, Chinese food, drink party(sometimes), 2ch
Agenda
• What is Bot ?• Chat bot history • type of chat bot
algorithm and demo• case study
What is Bot ?
• A computer program that simulates human conversation, or chat, through artificial intelligence.(From wiki)
Why Chat Bot ?
• This year, chat bot has attracted a great deal of public attention.
History of Chat bot
• ELIZA “doctor”(1966)An early example of primitive NLP chat bot
• A simulation of a psychotherapist. On your Emacs
• you can find free source about ELIZA in internet
Type of Chat bot
I bought a computer
I like the computer
Hello
Hello I’m kanono !
1. If-then-eles Type
• If the words in dictionaries, bot can respond to you.• Accuracy depends on the volume of the dictionary
example : ELIZA
Type of Chat bot2. Use dialogue dataStore dialogue data into DB and response a similar message.
• The response would be a human-like message.• Accuracy depends on the volume of the dialogue
Are you free now?
sorry I’m really busy
That too bad...
What happened?
Are:1, you:1, free:1, now:1
dialogue DB
ex : パン太一郎
Type of Chat bot3. Generate model• generate sentence with statistical method.• calculate the next words appearance ratio.
which dessert do you like the best ?
Yogurt is dessert
I really like frozen yogurt
I watched “Frozen” last night
• Can use variety of phrase and expression• Difficult to control context in message
Dialogue corpus
Generate Model
Algorithm for Generate model
1. Markov chainI am John.I am kanono.I do not like English
I
do
am
not
0.66
0.33
John
kanono
0.5
0.5
・・・
Strong in make a short sentence.
not good at generating long story.with large scale corpus data
Demo
What I made?
Input: ImageOut put: generate a sentence related to the image
Hi ! I’m going to take a flight to NY training, see you soon!
技術解説(画像分類部分)
アルゴリズム1 位 aircraft( 飛行機 )2 位 plane ...
上位 10 クラスのスコアを出力→ 日本語に翻訳
ラベル
技術解説(ツイート生成部分)1 位 aircraft (飛行機)
ラベル
ラベルに該当するツイートをたくさん取得♡
・・・
テキストを自動生成(自作)今日は飛行機にのるよー!名古屋みんな待っててね!
ヒミツの自作アルゴリズム♥
Algorithm for Generate model
2. RNN(Reccurent Neural Network) ex.) Allo, りんな
RNN for semantic analysis RNN for generate response
Case study
• Check my Qiita page• http://qiita.com/kanottyan/items/
2783bf91c8ea6a8a4ce8