Developer community and repository data
- Identify top talent
- Stay ahead of competition
- Mitigate investment risks
- Valuable for investment, market research, and HR intelligence
Data Points | Example Values |
---|---|
Username | Andy182 |
Location | Glasgow |
Organization | IBM |
Occupation | Data Scientist |
Follower count | 24 |
Communities | Ask Ubuntu |
Tags | Linux |
Has projects | True |
{
"doc": {
"source_id": 69642661,
"id": "github_people_6964765432661",
"image": "https://avatars.githubusercontent.com/u/6966543542661?v=4",
"bio": null,
"contact_info": {
"blog": "",
"twitter": null
},
"company": null,
"events_url": "https://api.github.com/users/alwin48/events{/privacy}",
"follower_count": 14,
"following_count": 28,
"hireable": null,
"url": "https://github.com/alwin485345",
"location": null,
"username": "alwin485324",
"name": "Alwin Joseph",
"node_id": "MDQ6VXNlcjY5NgfdbsjQyNjYx",
"public_gist_count": 0,
"public_repo_count": 9,
"starred_repos_count": 70,
"site_admin": false,
"type": "User",
"repo": [{
"disabled": false,
"archived": false,
"created_at": "2020-12-13T10:59:42Z",
"default_branch": "main",
"description": "A progresive web app (PWA) which utilizes whitespaces to make text invisible",
"fork": true,
"fork_count": 0,
"forked_from": "https://www.github.com/FOSS-Cell-GECGFDVPKD/Hide-it",
"has_downloads": true,
"has_issues": false,
"has_pages": false,
"has_projects": true,
"has_wiki": true,
"website": "https://hide-it.netlify.app/",
"url": "https://github.com/alwin48532453/Hide-it",
"source_id": 32104543253607,
"language": null
}]
}
What is community & repository data?
Community and repository data allows you to find software projects and the best talent in IT industry. We offer data from coding, programming, web development, app development, software development communities, and more. It provides you with data points such as company name, location, repos summary, script summary, and more. It's parsed, accurate, and ready to use. This data is available as a flat file, delivered in JSON or JSONL.
Top datasets
Community and repository data is divided between multiple datasets, each one corresponding to its respective data source.
GitHub Users
GitHub Users data consists of over 67M records and provides you with data points such as follower count, hireability, URL, location, name, repository counts, and more.
GitHub Branches
GitHub Branches data consists of over 789M records and provides you with data points such as source ID, name, protection, repository name/owner, and more.
GitHub Contributions
GitHub Contributions data consist of over 148M records and provides you with data points such as repository name/owner/URL, author information, number of contributions, and more.
Community and repository data use cases
Target market analysis
Developer community and repository data can provide investors and HR tech companies with valuable insights into market trends and the popularity of certain programming languages, technologies, and tools. This information can help investors and HR tech businesses make informed decisions about their investments and hiring strategies.
Talent sourcing
HR tech companies can leverage developer community and repository data to identify and engage with top talent. By analyzing contributions to open-source projects, HR tech companies can identify developers with the right skills and experience for their organizations.
Competitive analysis
Developer community and repository data helps investors and HR tech companies to research competitor strategies and understand how their investments and talent acquisition strategies compare to others in the market.
Flexible data delivery options
When buying datasets, you can select data formats, delivery methods, and frequency that are convenient for your business.
Why 500+ companies choose Coresignal
Global coverage
The database connected to our APIs consists of data records from across the globe.
In the market since 2016
Our team includes some of the most experienced web data extraction professionals.
Responsible data collection
We only collect publicly available web data, in line with the highest data privacy standards.
Frequently asked questions
A developer community is a place where developers share their projects, knowledge, progress, and advice, among other things.
Coresignals developer community and repository data sources include GitHub and Docker Hub.
You can find tech talent in community and repository data or employee data.
We collect community and repository data from various public web sources and put it into several databases. Different data sources have separate datasets of respective community and repository data records.
Data security is one of the main priorities. We store data in a protected dataset to avoid breaches and leaks of sensitive information.