Week 1
Literature reviews, overview of the project.
Meeting
the idea is to use the dynamic measures to see the graph
for HD features or combined features, using a connected scatter plot seems interesting
narrow down the design, and focus more on the problem itself rather than users
Graph Measures
Measure Level
time: snapshot → period → lifetime
component: local → subgraph → global
"aggregated value" refers to the calculated metric value from an aggregated graph. "series value" means a value sequences for each snapshot during the period. And "derived value" is the derived value from series value, such as extremes, average value, deviance, trend, stability, fastest increase, etc.
Details
Global
Static
count, ratio
number of nodes, active nodes, links, node pairs, connected components, motifs (triads, cliques...), clusters, activation, redundancy
clustering coefficient
extreme
diameter,
average
characteristic path length ( shortest path length), efficiency (1/shortest path)
others
modularity
Dynamic
global volatility
Local
Static
centrality: identify the most important vertex or link in a graph
degree, strength (sum of weight)
eccentricity (max d_ij)
closeness (average shortest path length)
betweenness
others
pagerank, redundancy
Dynamic
volatility: sum presence variance of links to a node
activation: count of new connections
High-Dimensional Features (learned from data)
Deepwalk, Graph2Vec, SDNE, matrix factorization, persistent diagram, adjacent matrix
Last updated