This is a replacement for #65 which lasted too long to be effective.
Context
Theory
Two papers in MSR 2019 have shown that we can use dependencies graphs to extract information. Namely:
Business
As detailed in the ML e-book, there are multiple business use-cases related to the dependency graph:
- Grouping projects by dependency similarity, either in 2D or answering nearest neighbours queries.
- Finding frequent dependency sets, the same way as retailers determine frequent product baskets.
- Finding competing dependencies.
- Suggesting new dependencies.
- Recommending alternatives.
It is also a strong possibility that we could leverage dependencies graph to group similar developers, and assess expertise in libraries or ecosystems.
Objective
We want to use the dependency graph of PGA to create embeddings for as much libraries as possible, and explore how we could use these to answer the business perspectives described above.
Checklist
This is a replacement for #65 which lasted too long to be effective.
Context
Theory
Two papers in MSR 2019 have shown that we can use dependencies graphs to extract information. Namely:
Business
As detailed in the ML e-book, there are multiple business use-cases related to the dependency graph:
It is also a strong possibility that we could leverage dependencies graph to group similar developers, and assess expertise in libraries or ecosystems.
Objective
We want to use the dependency graph of PGA to create embeddings for as much libraries as possible, and explore how we could use these to answer the business perspectives described above.
Checklist