Developer community and repository data

  • Identify top talent
  • Stay ahead of competition
  • Mitigate investment risks
  • Valuable for investment, market research, and HR intelligence
1B+
data records
2
data sources
Systematic
data updates and discovery
9
years in the market
Data PointsExample Values
UsernameAndy182
LocationGlasgow
OrganizationIBM
OccupationData Scientist
Follower count 24
CommunitiesAsk Ubuntu
TagsLinux
Has projectsTrue
{
	"doc": {
		"source_id": 69642661,
		"id": "github_people_6964765432661",
		"image": "https://avatars.githubusercontent.com/u/6966543542661?v=4",
		"bio": null,
		"contact_info": {
			"blog": "",
			"twitter": null
		},
		"company": null,
		"events_url": "https://api.github.com/users/alwin48/events{/privacy}",
		"follower_count": 14,
		"following_count": 28,
		"hireable": null,
		"url": "https://github.com/alwin485345",
		"location": null,
		"username": "alwin485324",
		"name": "Alwin Joseph",
		"node_id": "MDQ6VXNlcjY5NgfdbsjQyNjYx",
		"public_gist_count": 0,
		"public_repo_count": 9,
		"starred_repos_count": 70,
		"site_admin": false,
		"type": "User",
		"repo": [{
			"disabled": false,
			"archived": false,
			"created_at": "2020-12-13T10:59:42Z",
			"default_branch": "main",
			"description": "A  progresive web app (PWA) which utilizes whitespaces to make text invisible",
			"fork": true,
			"fork_count": 0,
			"forked_from": "https://www.github.com/FOSS-Cell-GECGFDVPKD/Hide-it",
			"has_downloads": true,
			"has_issues": false,
			"has_pages": false,
			"has_projects": true,
			"has_wiki": true,
			"website": "https://hide-it.netlify.app/",
			"url": "https://github.com/alwin48532453/Hide-it",
			"source_id": 32104543253607,
			"language": null
		}]
	}

What is community & repository data?

Community and repository data allows you to find software projects and the best talent in IT industry. We offer data from coding, programming, web development, app development, software development communities, and more. It provides you with data points such as company name, location, repos summary, script summary, and more. It's parsed, accurate, and ready to use. This data is available as a flat file, delivered in JSON or JSONL.

Top datasets

Community and repository data is divided between multiple datasets, each one corresponding to its respective data source.

Github users

GitHub Users

GitHub Users data consists of over 67M records and provides you with data points such as follower count, hireability, URL, location, name, repository counts, and more.

See more
Github branches

GitHub Branches

GitHub Branches data consists of over 789M records and provides you with data points such as source ID, name, protection, repository name/owner, and more.

See more
Github contributions

GitHub Contributions

GitHub Contributions data consist of over 148M records and provides you with data points such as repository name/owner/URL, author information, number of contributions, and more.

See more
Other community and repository data sources
Docker Hub

Community and repository data use cases

Developer community and repository data can provide investors and HR tech companies with valuable insights into market trends and the popularity of certain programming languages, technologies, and tools. This information can help investors and HR tech businesses make informed decisions about their investments and hiring strategies.

Industry/company benchmarking

Talent sourcing

HR tech companies can leverage developer community and repository data to identify and engage with top talent. By analyzing contributions to open-source projects, HR tech companies can identify developers with the right skills and experience for their organizations.

Book a free consultation

Competitive analysis

Developer community and repository data helps investors and HR tech companies to research competitor strategies and understand how their investments and talent acquisition strategies compare to others in the market.

Flexible data delivery options

When buying datasets, you can select data formats, delivery methods, and frequency that are convenient for your business.

But don’t take us at our word.
Listen to our clients.

Find more reviews on Datarade.

"We are using Coresignal to enrich our AI platform for Sales Pipeline Growth. We proactively recommend sales-ready opps, interested buyers, warm intros, and trusted actions, which results in +25% in net new pipeline in 2 months, and +40% after 6 months."

Lead generation client

"Before we started working with Coresignal, the percentage of investments that we made that had data influence was around 2% and currently it's around 65%."

Venture capital client

"We chose Coresignal because of the coverage, data freshness, and ability to extend to other data sources."

Sales tech client

Why 500+ companies choose Coresignal

Global

Global coverage

The database connected to our APIs consists of data records from across the globe.

Long expertise

In the market since 2016

Our team includes some of the most experienced web data extraction professionals.

Responsible data collection

Responsible data collection

We only collect publicly available web data, in line with the highest data privacy standards.

Frequently asked questions

What is a developer community?

A developer community is a place where developers share their projects, knowledge, progress, and advice, among other things.

What are Coresignal's developer community and repository data sources?

Coresignals developer community and repository data sources include GitHub and Docker Hub.

Where to find tech talent?

You can find tech talent in community and repository data or employee data.

How is community and repository data collected?

We collect community and repository data from various public web sources and put it into several databases. Different data sources have separate datasets of respective community and repository data records.

Who uses community and repository data?

Coresignal’s community and repository data is being used by investors and HR platforms that use it to generate investment signals and source talent.

How secure is the data?

Data security is one of the main priorities. We store data in a protected dataset to avoid breaches and leaks of sensitive information.