October 15, 2020
With a market size of 1.06 billion in 2019 according to Grand View Research, the alternative data space is becoming a necessity for investors, analysts, and hedge fund managers alike. But why is alternative data on the rise? The answer isn’t as straightforward as one would think. Alternative data compiled with traditional data offers investors and analysts business insights that are both predictive and reliable. This article will explain the essentials of alternative data as well as its importance to investors and hedge funds.
So, what exactly is alternative data? Alternative data refers to data that exists beyond the sphere of traditional data sources. These alternative sources are utilized by investors, analysts, and hedge funds to aid in company and investment evaluation. Some brief examples of alternative data include public social media/sentiment, satellite and weather sensor data, transactional data from credit and debit cards, email receipts, survey data, and much more.
In increasingly data-driven financial markets, alternative data provides investors with fresh sources for enhancing their alpha. Alpha, also known as an active return on investment, measures the performance of an investment based on an appropriate market index. Ultimately, alternative data provides investors with robust data insights for their investments, increasing their active return on investment.
Alternative data sometimes referred to as exhaust or external data, involves data generated by three sources: individuals, business processes, and sensors. Whereas traditional data is exclusive to data that’s produced by companies themselves, alternative data relies on the collection and scraping processes. For example, traditional data can be things such as investor presentations, SEC filings, financial statements, and press releases.
While both alternative and traditional data provide financial insights into a given company, the progress surrounding the alternative data sector provided investors and hedge funds with a more complete picture of short-term and long-term investments and business decisions.
Over the past decade, alternative data sources have grown significantly. In its infancy, alternative data sources included credit card transactions, web-scraped data, and geolocations. As alternative data continues to prove its success, along with the advent of new data sources, experts in alternative data are finding new ways to retrieve these datasets as well as analyze them.
Alternative data, sometimes referred to as exhaust or external data, involves data generated by three sources: individuals, business processes, and sensors.
According to Forbes, alternative data has expanded to include a taxonomy of 24 different types of data. Beyond these 24 data types, alternative data, as mentioned earlier, is generated from three main sources: individuals, business processes, and sensors. Let’s take a more detailed look at what they entail.
While individuals provide large amounts of robust qualitative data, alternative data generated by individuals also proves to be highly unstructured and often difficult to sift through. For this reason, it’s important that datasets consisting of individual data are well-structured and that only the essential data points get picked. Some individual data sources include:
Unlike individual-driven data, businesses tend to produce structured data that provides investors with comprehensive financial insights for a given business. Some business process data sources include:
Another largely unstructured data type, data generated from sensors, is expanding in the alternative data field. Due to the interconnectivity of devices and the development of sensor-based technology analysts and investors alike find this data extremely valuable. Some sensor data sources include:
As mentioned previously, alternative data is acquired through web-scraping, acquisition of raw data, and third-party licensing.
Web-scraping processes unstructured data from web pages and online sources. Through data parsing methods, the data is transformed into structured and readable CSV and JSON formats ready for interpretation by analysts and investors. This is the sort of data that Coresignal offers.
Raw data acquisition is the process of obtaining sizable amounts of unprocessed data, obtained from multiple source types. This data type contains lots of noise and requires significant processing in order to be analyzed and utilized by investors.
Third-party licensing refers mainly to recovered financial data involved in transactions. Companies recover said transactional data like POS data, credit card transactions, etc., and package it in easy-to-read formats for investors.
Due to alternative data’s multiplicity of use cases, many professionals have found a use for alternative data. Professionals that utilize alternative data the most are algorithmic traders, also known as quants, who use the datasets they acquire to construct computer models for trade. However, alternative data is also utilized by hedge funds, analysts, and other institutional investment professionals.
Alternative data covers a broad range of factors that influence a company’s alpha, making alternative data sets a commodity for predicative investing. Specifically, investors consider factors such as scarcity, granularity, history, structure, and coverage when choosing what types of alternative data sets to invest in.
Hedge funds utilize some of the most well-known alternative data sets, including social sentiment, credit card transactions, and web data. These data types provide hedge funds with competitive intelligence and risk management insights due to their pervasiveness in the alternative data space. Additionally, hedge funds find alternative data favorable due to the accessibility of alternative data compared to traditional data, which sometimes takes longer (due to legal processes) for official figures.
Further, some widely used data sets such as credit card transactions and traffic patterns, are important to both private equity firms and hedge funds. On one hand, private equity investment decisions tend to focus more on the long-term, while on the other hand hedge funds focus on both long-term and short-term. Both hedge funds and private equity firms benefit from alternative data because alternative provides robust short-term and long-term insights.
Coresignal offers alternative data from 20 unique sources in 8 categories, scraped from various public web sources. Our up-to-date, raw data from data categories such as firmographics, technographics, job postings data, and public resume data can help you unlock powerful business and investing insights. To learn more information about the data we offer, check out our alternative data page.
Whether you are an investor, an analyst, or an algorithmic trader, understanding the untapped potential of alternative data sets is paramount. Based on the proven success of alternative data, it seems like the sky's the limit to what a business can achieve by leveraging it.
There are 24 different types of alternative data. The most common include firmographic data, social media data, and sensor data.
Firmographic data is collected from web sources and contains information that pertains to a company's internal structure and external operations.
Social media data is collected from social media platforms such as networking sites, chat sites, review sites, etc.
Sensor data is collected by a sensor, such as geo-location, satellites, POS data, and CCTV data.