Title: Toward a Large-scale Database for API Usage Graphs of Java Source Code
Speaker: Ahn Nguyen, ECpE Graduate Student
Advisor: Tien Nguyen, Associate Professor
Abstract: The API usage graph representation of source code has been successfully used for different applications like code recommendation, clone detection and pattern mining. Investigating the characteristics of the APIs and their usages in different scenarios is crucial for the advancement in both programming languages and software engineering. In this seminar, I will discuss about UsageBank, a large-scale data bank of API usages that can be queried on different usage properties of existing APIs. Via mining API usage graphs from a large scale open source code repository, UsageBank can collect 621 million unique usage graphs. Empirical experiments on querying of UsageBank’s usage graphs show promising results and future applications of the data bank.