coresignal
Datasets

Professional network data

Leverage our top B2B datasets

Job posting data

Get access to hundreds of millions of jobs

Employee review data

Get data for employee sentiment analysis

Clean dataNEW

Enhanced professional network data

Employee data

Get data on global talent at scale

Funding data

Discover and analyze funding deals

Firmographic data

Unlock a 360° view of millions of companies

Technographic data

Analyze companies’ tech stacks

See all datasets

BY INDUSTRY

MOST POPULAR USE CASES

Pricing
Datasets
Data APIs
Data sources
Use cases
Resources
Pricing
arrow left
arrow right
Home
arrow right
community and repository data

Developer community and repository data

  • Identify top talent
  • Stay ahead of competition
  • Mitigate investment risks
  • Valuable for investment, market research, and HR intelligence
1B+

data records

2

data sources

Systematic

data updates and discovery

8

years in the market

What is community & repository data?

Community and repository data allows you to find software projects and the best talent in IT industry. We offer data from coding, programming, web development, app development, software development communities, and more. It provides you with data points such as company name, location, repos summary, script summary, and more. It's parsed, accurate, and ready to use. This data is available as a flat file, delivered in JSON.

Dictionary
JSON
Data Points Example Values
Username Andy182
Location Glasgow
Organization IBM
Occupation Data Scientist
Follower count 24
Communities Ask Ubuntu
Tags Linux
Has projects True

What is community & repository data?

Community and repository data allows you to find software projects and the best talent in IT industry. We offer data from coding, programming, web development, app development, software development communities, and more. It provides you with data points such as company name, location, repos summary, script summary, and more. It's parsed, accurate, and ready to use. This data is available as a flat file, delivered in JSON.

Top datasets

Community and repository data is divided between multiple datasets, each one corresponding to its respective data source.

GitHub

GitHub Users

GitHub Users data consists of over 67M records and provides you with data points such as follower count, hireability, URL, location, name, repository counts, and more.

See more

GitHub Branches

GitHub Branches data consists of over 793M records and provides you with data points such as source ID, name, protection, repository name/owner, and more.

GitHub Contributions

GitHub Contributions data consist of over 149M records and provides you with data points such as repository name/owner/URL, author information, number of contributions, and more.

Other community and repository data sources

Community and repository data use cases

Target market analysis

Developer community and repository data can provide investors and HR tech companies with valuable insights into market trends and the popularity of certain programming languages, technologies, and tools. This information can help investors and HR tech businesses make informed decisions about their investments and hiring strategies.

Talent sourcing

HR tech companies can leverage developer community and repository data to identify and engage with top talent. By analyzing contributions to open-source projects, HR tech companies can identify developers with the right skills and experience for their organizations.

Competitive analysis

Developer community and repository data helps investors and HR tech companies to research competitor strategies and understand how their investments and talent acquisition strategies compare to others in the market.

Have another use case in mind?

Contact our sales team and we will do our best to help you.

Flexible data delivery options

When buying datasets, you can select data formats, delivery methods, and frequency that are convenient for your business.

But don't take us at our word. Listen to our clients.

Find more reviews on Datarade.

Start Quote

We are using Coresignal to enrich our AI platform for Sales Pipeline Growth. We proactively recommend sales-ready opps, interested buyers, warm intros, and trusted actions, which results in +25% in net new pipeline in 2 months, and +40% after 6 months.

Lead generation client

Before we started working with Coresignal, the percentage of investments that we made that had data influence was around 2% and currently it's around 65%.

Venture capital client

Coresignal has strong demographic and firmographic datasets both on quality and volume while keeping the data as fresh as it can be. We've been using Coresignal for years and we can only speak highly about the product and the team behind it. Highly recommended.

Venture capital client

End Quote

Find more reviews on Datarade.

Why 400+ companies choose Coresignal

Reliable and convenient delivery

We offer data in multiple formats, flexible delivery frequency and ensure transparent information about data operations to our clients.

people

Exceptional client support

Get the most out of your data with the help of Coresignal's dedicated account managers. We value long-term relationships and strive to provide quick support.

continuity

8 years in the market

Our team includes some of the most experienced web data extraction professionals. The advanced infrastructure they built over the years allows us to expand our datasets daily.

contact us

Stay ahead of the game with fresh web data

Coresignal's data helps companies achieve their goals

Frequently asked questions

What is a developer community?

A developer community is a place where developers share their projects, knowledge, progress, and advice, among other things.

What are Coresignal's developer community and repository data sources?

Coresignals developer community and repository data sources include GitHub and Docker Hub.

Where to find tech talent?

You can find tech talent in community and repository data or employee data.

How is community and repository data collected?

We collect community and repository data from various public web sources and put it into several databases. Different data sources have separate datasets of respective community and repository data records.

Who uses community and repository data?

Coresignal’s community and repository data is being used by investors and HR platforms that use it to generate investment signals and source talent.

How secure is the data?

Data security is one of the main priorities. We store data in a protected dataset to avoid breaches and leaks of sensitive information. 

Company

Unlock new opportunities with Coresignal.

Follow us on social media

LinkedInX

Terms and conditions

Coresignal © 2024 All Rights Reserved