Distributed Memory Processing of Very Large Graphs

dc.contributor.advisorNorris, Boyana
dc.contributor.authorRiazi, Sara
dc.date.accessioned2020-02-27T22:35:15Z
dc.date.available2020-02-27T22:35:15Z
dc.date.issued2020-02-27
dc.description.abstractBig graphs such as social networks or the internet network, biological networks, knowledge graphs appear in many domains. However, processing these graphs rely on the accessibility of high-performance frameworks which are able to handle these large graphs. One aspect of this accessibility is the usability of the frameworks for a broad community of researches who do not have sufficient expertise to work with these frameworks. To address this issue, we introduce GraphFlow framework, a workflow-based framework that provides several graph mining components. GraphFlow benefits from data-parallel Apache Spark and its GraphX library, as the back-end, so it processes very large graphs. GraphFlow also supports the construction of experiment pipelines that involve running several components. Integrated into our GraphFlow framework, we also introduce a novel vertex-centric network embedding algorithm, which can learn low-dimensional vectors for vertices of very large graphs. Our network embedding algorithm can scale to graphs with billions of edges, while previous algorithms do not scale to the graphs of this scale. GraphFlow also supports dynamic graphs using graph snapshots and batch updates. We provide SSSPIncJoint, a novel algorithm for computing single-source shortest paths (SSSP) for dynamic graphs. SSSPIncJoint is significantly more efficient than running SSSP for each snapshot of a dynamic graph.en_US
dc.identifier.urihttps://hdl.handle.net/1794/25260
dc.language.isoen_US
dc.publisherUniversity of Oregon
dc.rightsAll Rights Reserved.
dc.subjectApache Sparken_US
dc.subjectBig Graphsen_US
dc.subjectDistributed Memoryen_US
dc.subjectGraphFlowen_US
dc.titleDistributed Memory Processing of Very Large Graphs
dc.typeElectronic Thesis or Dissertation
thesis.degree.disciplineDepartment of Computer and Information Science
thesis.degree.grantorUniversity of Oregon
thesis.degree.leveldoctoral
thesis.degree.namePh.D.

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Riazi_oregon_0171A_12622.pdf
Size:
3.55 MB
Format:
Adobe Portable Document Format