DATA SCHEMA

Last Updated: April 15, 2026

Blog Image 01

Introduction

This page provides a complete technical reference for every dataset available through HSH Intelligence. Before purchasing any dataset, buyers can review the full field definitions, data types, coverage rates, and example values for every data product we offer. Our goal is complete transparency — you know exactly what you are buying before you spend a single dollar.

Purpose

HSH Intelligence believes that informed buyers are the best buyers. This Data Schema exists to eliminate guesswork, reduce back and forth with our sales team, and give technical buyers, data engineers, and procurement teams everything they need to evaluate our datasets independently. Every field is documented. Every coverage rate is honest. Every example is real.

AI Training Intelligence Schema

Our AI Training Intelligence datasets contain production grade code files, real world financial filings, and millions of structured instruction response pairs. Fields include source repository or filing identifier, content type classification, domain category, instruction prompt text, response text, token count, quality score, language classification, and collection timestamp. All records are delivered in clean JSON or CSV format compatible with all major fine tuning frameworks including Hugging Face, OpenAI fine tuning API, and custom training pipelines.

Startup and Founder Intelligence Schema

Our Startup and Founder Intelligence datasets contain comprehensive profiles on companies that have passed through the world's most elite startup programs. Fields include company name, official website, industry classification, accelerator batch and cohort year, company description, one liner summary, founder full names, verified founder email addresses, headquarters location, team size, current hiring status, company stage, top company flag, nonprofit flag, accelerator profile URL, funding amount, funding round, funding source, funding confidence score, and collection timestamp. Email coverage varies by batch and ranges from 25 to 65 percent across the full dataset.

Decision Maker Intelligence Schema

Our Decision Maker Intelligence datasets contain verified profiles on technology leaders and senior executives actively evaluated for software purchasing decisions. Fields include full name, job title, company name, company website, professional profile URL, verified business email address, phone number where available, technology stack in use at their organization, buyer intent signals and categories, intent signal strength score, contract value estimate, geographic location, company size, industry classification, and data confidence score. Intent signal coverage is present on all records in this dataset.

E Commerce Intelligence Schema

Our E Commerce Intelligence datasets contain detailed profiles on active online merchants and store operators. Fields include store name, store URL, platform classification, owner full name, verified contact email, phone number where available, estimated monthly revenue range, primary product category, secondary product categories, store location, technology stack detected, social media presence, team size estimate, and collection timestamp. Revenue estimates are derived from public signals including traffic data, product volume, and platform indicators.

B2B Contact Intelligence Schema

Our B2B Contact Intelligence datasets represent our largest and most comprehensive data asset. Fields include company domain, company name, verified business email addresses, phone numbers where available, LinkedIn company profile, Twitter handle, technology stack detected on company website, company status, funding mentions, investor names, intent signals, estimated contract value, agency flag, state, country, data confidence score, data tier classification, enrichment score, and collection timestamp. This dataset is continuously growing and re enriched on an ongoing basis.

Data Delivery Formats

All datasets are available for delivery in CSV format compatible with Excel, Google Sheets, and all major CRM platforms. JSON format is available for all datasets and is recommended for developers and data engineering teams. API access is available for select datasets and provides real time query capability with field level filtering. Custom delivery formats and scheduled data refreshes are available for enterprise buyers on request.

Coverage and Freshness

HSH Intelligence datasets are continuously updated as new public data becomes available. Collection timestamps are included on every record so buyers can assess data freshness independently. We do not sell static one time snapshots. Our data is a living, growing intelligence asset that improves in coverage and accuracy over time. Enterprise buyers can request custom refresh schedules and incremental update feeds.

Contact Us

Healing Sun Haven LLC Data Division: HSH Intelligence 15442 Ventura Blvd, Suite 201-1914 Sherman Oaks, CA 91403 Email: info@healingsunhaven.com Website: www.healingsunhaven.com Data Division: www.hshintelligence.com

Your competitors are already using better data. Are you?

AI marketing  automation for data driven strategies.
Map Image

London, UK

Singapore

HQ: Sherman Oaks, CA

Your competitors are already using better data. Are you?

AI marketing  automation for data driven strategies.
Map Image

London, UK

Singapore

HQ: Sherman Oaks, CA

Your competitors are already using better data. Are you?

AI marketing  automation for data driven strategies.
Map Image

London, UK

Singapore

HQ: Sherman Oaks, CA