semi-supervised structuring of complex data...marian-andrei rizoiu semi-supervised structuring of...

8
Marian-Andrei Rizoiu Semi-Supervised Structuring of Complex Data Semi-Supervised Structuring of Complex Data Marian-Andrei RIZOIU Beijing, China August 5 th , 2013 IJCAI 2013 ERIC Laboratory, University Lumière, Lyon, France

Upload: others

Post on 23-Sep-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Semi-Supervised Structuring of Complex Data...Marian-Andrei Rizoiu Semi-Supervised Structuring of Complex Data Context Some applications My approach 3 extract knowledge from complex

Marian-Andrei Rizoiu Semi-Supervised Structuring of Complex Data

Semi-Supervised Structuring of Complex Data

Marian-Andrei RIZOIU

Beijing, China

August 5th, 2013IJCAI 2013

ERIC Laboratory, University Lumière, Lyon, France

Page 2: Semi-Supervised Structuring of Complex Data...Marian-Andrei Rizoiu Semi-Supervised Structuring of Complex Data Context Some applications My approach 3 extract knowledge from complex

Marian-Andrei Rizoiu Semi-Supervised Structuring of Complex Data

Context Some applications

The context of my PhD work - complex data

Title (structure)

Hymn (audio)

Flag and coat of arms(image)

Description (text)

Hyperlinks (external knowledge

sources)Map

(image)

Indicators (numerical format)

Specificities:

Different types of dataAdditional information

Temporal dimension / dynamic data

High dimensionality

Diverse / distributed sources

2

Page 3: Semi-Supervised Structuring of Complex Data...Marian-Andrei Rizoiu Semi-Supervised Structuring of Complex Data Context Some applications My approach 3 extract knowledge from complex

Marian-Andrei Rizoiu Semi-Supervised Structuring of Complex Data

Context Some applications

My approach

3

extract knowledge from complex data, often in an unsupervised context

General objective:

add semantics in data analysis

leverage available side-information (supervision)

Research challenges

Leveraging semantics when dealing with complex data

Leveraging the temporal dimension of complex data

translating data into a semantic-aware representation space

injecting knowledge into machine learning algorithms

Page 4: Semi-Supervised Structuring of Complex Data...Marian-Andrei Rizoiu Semi-Supervised Structuring of Complex Data Context Some applications My approach 3 extract knowledge from complex

Marian-Andrei Rizoiu Semi-Supervised Structuring of Complex Data

Context Some applications

The schema structuring my work

3

Page 5: Semi-Supervised Structuring of Complex Data...Marian-Andrei Rizoiu Semi-Supervised Structuring of Complex Data Context Some applications My approach 3 extract knowledge from complex

Marian-Andrei Rizoiu Semi-Supervised Structuring of Complex Data

Context Some applications

Applications: Detecting temporal evolution patterns in a population of entities recorded over a period of time

4

Page 6: Semi-Supervised Structuring of Complex Data...Marian-Andrei Rizoiu Semi-Supervised Structuring of Complex Data Context Some applications My approach 3 extract knowledge from complex

Marian-Andrei Rizoiu Semi-Supervised Structuring of Complex Data

Context Some applications

{ groups, sky, tree, building, street }

{ sky building tree ∧ ∧ ∧ forest,sky groups street ∧ ∧ }

{ groups, water, cascade, sky, tree,

lawn, forest }

{ groups ∧ street ∧ interior ,water cascade tree forest,∧ ∧ ∧

sky ∧ building ∧ panorama }

Applications: Construct interpretable features, which capture better the semantics of the dataset and reduce feature correlation

5

Page 7: Semi-Supervised Structuring of Complex Data...Marian-Andrei Rizoiu Semi-Supervised Structuring of Complex Data Context Some applications My approach 3 extract knowledge from complex

Marian-Andrei Rizoiu Semi-Supervised Structuring of Complex Data

Context Some applications

Expression cloud for each topic

Temporal evolution of topics

Evolution of the popularity of a topic

Social Network modeled as a multi-graphVertexes: the users ; Arcs: the comments associated with topics

Based on the structural citation relation

Applications: CommentWatcher, an open-source web-based platform for analyzing discussions on web forums

6

Page 8: Semi-Supervised Structuring of Complex Data...Marian-Andrei Rizoiu Semi-Supervised Structuring of Complex Data Context Some applications My approach 3 extract knowledge from complex

Marian-Andrei Rizoiu Semi-Supervised Structuring of Complex Data

Context Some applications

Thank you !