big data analytics and visualization - mta cloud · 2017-02-20 · big data analytics and...
TRANSCRIPT
![Page 1: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/1.jpg)
Big Data analytics
and
Visualization
MTA Cloud symposium
A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard
CERN
MTA Head quarters, Budapest, 17 February 2017
1
![Page 2: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/2.jpg)
Background information
• Collaboration Spotting (CS) Platform (V2) used to process examples
• CS is a Visual Analytics tool originally developed to analyse the technology landscape of key enabling technologies for the Particle Physics programme at CERN • Using Publications and Patent metadata
• The CS Platform has been used to visualize other datasets: • CERN procurement data
• Ceased assets in collaborations with the UN-UNCRI
• Neuro-science data in collaboration with Wigner
CS Platform V3 2
![Page 3: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/3.jpg)
Characteristics of Big Data
• Huge quantity
• Distributed sources
• Complexity
• Interconnectivity
• Processing and storage
• Access rights, security
• Valuable information may be
hidden behind complexity
• Unravelling new knowledge
Data scientists are instrumental to analytics
Domain experts are at the heart of the reasoning process 3
![Page 4: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/4.jpg)
Big Data is organised in networks
• Document systems with metadata in Database
• Database tables with metadata in schema
Big Data is distributed
• Connectivity not materialised due to the distributed nature of data sources
• Connectivity relates to the understanding of the data
Big Data is strongly
interconnected
4 Technical challenges of using Big Data analytics
![Page 5: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/5.jpg)
Big Data Intrinsic vs additional value
• The additional value of Big Data comes from
its interconnectivity
Technical challenges of using Big Data analytics 5
Relational DBMS
Discrete data
No-SQL
Connected data
Graph DB
Conventional analytics Conventional + visual analytics
![Page 6: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/6.jpg)
Two Criteria:
Bottom-up VS Top Down
Discrete data VS highly interconnected data
6 Technical challenges of using Big Data analytics
![Page 7: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/7.jpg)
Top-Down VS Bottom-up
Typically hard sciences Empirical approach
• Process driven
• Hypothesis
• Simulation software
• Validation with real data
• Review hypothesis
• Experiments
• Compare results with
simulation
• Data driven
• Extract features from data
• Generate hypothesis
• Run what-if scenario
• Validate with data
• Big Data
• Software for domain expert to
make sense out of Big Data
Technical challenges of using Big Data Analytics 7
Relational DBMS
Discrete data
No-SQL
Connected data
Graph DB
Domain Expert
![Page 8: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/8.jpg)
8
Domain expert vs Data scientist
Source: JIOX: Intelligence Tradecraft & Analysis
Domain expert
Domain expert
Software developer
Software developer
Cycle is managed by
Data Scientist
![Page 9: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/9.jpg)
9
• Experts have the knowledge
• Data scientists have the skills
Data scientists to build platforms that enable experts to
perform analytics by themselves
Domain expert
• Bring analytics to experts • “Understand” results of analytics
• “Instruct” computers to perform
analytics according to findings
Challenge Bring domain experts
at the centre of the visual analytics cycle
![Page 10: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/10.jpg)
What is required?
• Support interconnectivity
• Support Cross Domain applications
Network Data and Domain
independent
• Support any combination of data sources
• Support any combination of data structures
Scalable and flexible
• Support visualisation of network content
• Support visualisation of analysis results
Easily accessible and navigable to
Experts
• Support navigation of network content
• Support queries of network content
Enhance value of Data Network for
Experts
Technical challenges of using Big Data analytics 10
Smart Data management
concepts and tools
Smart graphic management
concepts and tools
A Domain independent platform
![Page 11: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/11.jpg)
Smart Data Management
• Complexity
• Interconnectivity
• Scalability
• Multi dimensional
Directed graphs are natural representations
of large and interconnected
datasets
• Nodes’ labels
• Compact graph structure
• Graph query language
• No schema evolution
Schema is embedded in the data
• Schema: labels and edges (interconnectivity)
• Labels Graph dimensions
• Edges Directed relationships between Labels
• Data graph: vertices and edges
• Vertices: data instances and dimension instances
• Edges: Directed relationships between vertices
Graphs of connected elements constitute multi-dimensional
networks
Graph Databases offer a natural support for storing network information
Label property graph data model 11
![Page 12: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/12.jpg)
Building a Network from two data
sources (Pub/Pat)
Document metadata Graph of data types
SCat: Journal category, Kw: Keyword, Org: Organisation,
Cny: Country, Tech: Technology 12 Technical challenges of using Big Data analytics
![Page 13: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/13.jpg)
Graph of Network
T: Technologies, A: Pub/Pat, K: keywords, O: Organisations, C: Countries 13
![Page 14: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/14.jpg)
Data Graph & Graph Schema
14 Technical challenges of using Big Data analytics
Reachability Graph Graph of data network
![Page 15: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/15.jpg)
Building multi-dimensional networks
File Systems Tables in
DB
Graphs in
DB
Processing/Populating/Labelling/Organising
Multi-dimensional
Network in
GraphDB
No limitations on the sixe of a network! 15 Technical challenges of using Big Data analytics
![Page 16: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/16.jpg)
Combining data sources Enriching
networks More interconnectivity
No limitations on the extension of the network’s schema!
Schema: Graph of datatypes/labels
Dimension: a datatype i.e. a node in the graph schema
• Data sources
• Publications/Patents
• Citations
• Institutions/Companies
• Data sources
• EU projects
• Financial data
• Geolocation data
16
![Page 17: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/17.jpg)
17
Smart Graphic Concepts and
Management tools
• Retain complexity
• Singularities
• Clusters/communities/patterns
Graphs are excellent for visualising networks
• Vertex label, shape, size and colour to visualise properties of datasets
• Edges colours to highlight clusters
Graphs contain many visual information
Visualisation enhances the perceptual reasoning potential of analytics
![Page 18: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/18.jpg)
Smart Graphic Concepts and
Management tools(2)
• Selecting network dimensions
• Traversing network dimensions
• Graphical queries
• Time/Frequency evolution
Maximizing human
understanding
• Viewing multiple data sources
• Looking for collaborations
• Sorting communities
• Contextual visualisation & analytics
Enhancing reasoning
18 Technical challenges of using Big Data analytics
![Page 19: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/19.jpg)
Selecting Visualisation dimensions
(Pub) (Pat)
(Org) (Kw)
(Cny)
(Tech)
(SCat)
Reference dimensions for Analytics Pub: Publications, Pat: Patents (Attributes: Title and abstract are used for semantic searches)
Visualisation dimensions of Analytics results: SCat: Journal category, Kw: Keyword, Org: Organisation and Cny: Country)
Technology Search: Czochralski Silicon wafer
Pub/Pat: documents found in search results 19
![Page 20: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/20.jpg)
Technology Search: Czochralski Silicon wafer
Pub/Pat: documents found in search results 20
(SCat) (Org)
Traversing Dimensions
(Pub) (Pat)
(Kw)
(Cny)
(Tech)
(SCat) (Org)
(Cny)
(Kw)
![Page 21: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/21.jpg)
How to scale up the “graph” approach for very large multi-
dimensional networks?
21 Technical challenges of using Big Data analytics
![Page 22: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/22.jpg)
Visual analytics features
• Visual analytics does not replace Big Data analytics Visualize results
• Maintain visual perception quality and user interactivity
• No matter the size
• No matter the diversity (dimensions)
• No matter the interconnectivity
Data sampling & filtering
Visualize subsets of network dimensions View data from different perspectives
22 Technical challenges of using Big Data analytics
![Page 23: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/23.jpg)
Visual analytics Needs
• Visualize part of data network with respect to
particular references and from different perspectives
• Reference: Data dimensions (labels)
• Perspective: Visual dimensions (labels)
• Need to navigate across visual dimensions
• => Visual queries
• Need to get contextual statistics
• In the context of a particular view
• Need to change Data Reference while navigating
• Queries adapted to change of reference
23 Technical challenges of using Big Data analytics
![Page 24: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/24.jpg)
Visual analytics Features (2)
• Structural vs Behavioural • Understand from the data how something is working
• Visualization • Maximum number of collaborations that can be processed
(~100k) to feed visualization
• Maximum number of vertices and edges one can visualize within a graph (~ 10k)
• Maximum number of Clusters one can visualize within a graph (~10k)
• Data quality • Can the data be trusted?
• How complete is the dataset under study?
24 Technical challenges of using Big Data analytics
![Page 25: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/25.jpg)
Visual analytics Needs(2)
• Need to visualize processes, interactions in addition to structure of data network • Connectivity graphs AND
• Causality graphs directed edges
• For large graphs: • Replace vertices with communities in complex graphs
• Compound graph approach
• For graphs built out of large collaborations • Replace 2-adic calculations with m-adic
• Example • Neuro science: paths of length 2 to visualize
input/process/target flows
25 Technical challenges of using Big Data analytics
![Page 26: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/26.jpg)
Reduce visual complexity & faster graph
processing: Hyperedges vs edges
Edges vs hyper-edges Technology search: BGO Crystals
Pub/Pat: documents found in search results
Organisation landscape hypergraph view Organisation landscape graph view
26
![Page 27: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/27.jpg)
Tailor visualisation to data
• STRATEGY: Combining various techniques to
support quality visual perception and user
interactions according to data and graph sizes
• Statistics
• Data sampling & Reduction
• Compound graphs
• 2-adic vs n-adic node-link graph representation
27 Technical challenges of using Big Data analytics
![Page 28: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/28.jpg)
Combining techniques for visualisation
28
Visual Data
Analysis:
Compute
Collaborations
#
collabo-
rations
too
large?
Community
Processing:
Compute Compound
Graph
Can
graph
be
layered
?
Visual Data
Analysis:
Build statistics
No
Yes
Yes
No
Visual Data
Analysis:
Filter/Reduce
dataset
Objective: Reduce dataset and graph content with very minimal loss in visual perception
#
clusters
too
large?
No
Yes
#
vertices
too large
Visual Analysis:
Display Graph with
vertices
Visual Analysis:
Display Graph with
clusters as vertices
Yes No
Visual Data
Analysis:
Data
Reduction/Sampling
Visualize
Data
anyway?
Yes No
Community
Processing:
Compute clusters
![Page 29: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/29.jpg)
Computing requirements for visualization
• Service users within a few seconds
• Heavy computing at the backend to process clusters, optimize layout and support visual navigation
• Need for Cloud computing
• Using machines with 4 CPU cores (8 threads), 8 GB of memory
• CPU vs GPU
• Comparing them using consumer level hardware (Intel Core i7, GeForce GTX 980)
CS Platform V3 29
![Page 30: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/30.jpg)
Computing requirements for visualization
• Computation on the CPU
• Graphs with tens of thousands of nodes and hundreds of thousands of edges, computing requires ~17 seconds.
• Further optimization can be achieved by further distributing the computation among multiple machines
• Computation on the GPU
• Same graphs compute ~8 times faster (~2 seconds)
• Distribution among multiple GPUs is a further possible optimization
CS Platform V3 30
![Page 31: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/31.jpg)
Computing requirements for visualization
0
5
10
15
20
25
CT 3D Database Silicon
Run
tim
e (
s)
Computation time on CPU vs GPU
CPU GPU
CS Platform V3 31
![Page 32: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/32.jpg)
The macaque case
32
g0: directed graph of brain area
interconnectivity*
(42 vertices = areas, 601 edges= interactions)
*Data/slide: L. Négyessy, A. Fülöp
g2: directed graph of cortical interactions*
(Input/Processing/Target)
(9869 vertices = IPT flows, 166219 edges = common
interactions)
g2 is too large for
visual perception
Communities
172 clusters
10668 edges
Technical challenges of using Big Data analytics
![Page 33: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/33.jpg)
Constructed Reachability Graph
33 Macaque brain network data: optimal for navigation
Brain
Area
Cerebral
lobe
Modality
L2_path ProcessType
InterLobe
g0 edges
g2 edges
g2 g0 connections
Input Area
Processing Area
Target Area
Are “type of processing” and “Interactive lobe”
- Vertex attributes?
- Visual dimensions?
![Page 34: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/34.jpg)
g0 graph
Technical Challenge of Using Big Data Analytics 34
![Page 35: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/35.jpg)
g2 (with intercluster edges)
CS platform concepts V3 35
![Page 36: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/36.jpg)
CS platform concepts V3 36
g2 (paths of length 2)
![Page 37: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/37.jpg)
Community_61 Egocentric
37
![Page 38: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/38.jpg)
CS platform concepts V3 38
Community_61
![Page 39: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/39.jpg)
39
Community_61
![Page 40: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/40.jpg)
CS platform concepts V3 40
![Page 41: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/41.jpg)
Conclusion
• To visualize Big Data Analytics output you need: • Graphs to store your data networks and their schema
• Graphs to view network structure through selected dimensions
• Graphs to navigate across dimensions to provide contextual data to visualisation tools
• To maintain visual perception you need to combine various techniques • Statistics, sampling, compound graph, layered graph
• To support structural and behavioural visualisation you need to explore • Clustering algorithms supporting directed edges
• Processes, interactions in relation with the data
41 Technical challenges with Big Data Analysis
![Page 42: Big Data analytics and Visualization - MTA Cloud · 2017-02-20 · Big Data analytics and Visualization MTA Cloud symposium A. Agocs, D. Dardanis, R. Forster, J.-M. Le Goff, X. Ouvrard](https://reader030.vdocuments.us/reader030/viewer/2022041108/5f0c51057e708231d434cbcf/html5/thumbnails/42.jpg)
Thank you for your attention!