2–5 Jul 2024
Osijek
Europe/Zagreb timezone

The Curse of Popularity: The Role of the Degree in Random Projections for Graph Embeddings

3 Jul 2024, 16:05
20m
D8 (School of Applied Mathematics and Informatics, J. J. Strossmayer University of Osijek)

D8

School of Applied Mathematics and Informatics, J. J. Strossmayer University of Osijek

Trg Ljudevita Gaja 6, Osijek
Talk PSF: Probability, Statistics and Financial Mathematics Probability, Statistics and Financial Mathematics

Speaker

Tvrtko Tadić (Microsoft Corporation, Redmond)

Description

Random Projections have been widely used to generate embeddings for various large graph tasks due to their computational efficiency in estimating relevance between vertices. The majority of applications have been justified through the Johnson-Lindenstrauss Lemma. We take a step further and investigate how well dot product and cosine similarity are preserved by Random Projections. Our analysis provides new theoretical results, identifies pathological cases, and tests them with numerical experiments. We find that, for nodes of lower or higher degrees, the method produces especially unreliable embeddings for the dot product, regardless of whether the adjacency or the transition (normalized version) is used. With respect to the noise introduced by Random Projections, we show that cosine similarity produces remarkably more precise approximations. This work builds on many experiments the Graph Intelligence Sciences team at Microsoft did to compute relevance between entities (email, documents, people, events, ...) in Office 365. Joint work with Cassiano Becker and Jennifer Neville.

Primary author

Tvrtko Tadić (Microsoft Corporation, Redmond)

Presentation materials

There are no materials yet.