LLC Filing Data: What It Contains and How to Use It

May 7, 2026 | 6 min read

LLC Filing Data Is Public — Here's Why Most People Use It Wrong

Every state publishes LLC formation records. They're public filings, free to access, and updated continuously. That's not a secret. What most people miss is that the raw data is nearly useless without a classification layer — and most vendors selling "LLC data" are still selling you a static export from a database that was fresh six months ago.

I built AlphaLeads because I couldn't find anyone doing LLC filing data the way I thought it should be done: pull it daily, classify it by industry, enrich what you can, and deliver it fresh. This post is about what that actually involves — including the parts that don't work cleanly.

What LLC Filing Data Actually Contains

When a new LLC files with a state secretary of state, the public record typically includes the company name, registered agent, filing date, and sometimes a principal address. That's roughly it. There's no industry code, no SIC classification, no employee count, no revenue estimate. Just a name and a date.

The company name is doing a lot of work in that list. "Sunrise Services LLC" tells you almost nothing. "Golden State Roofing and Gutters LLC" tells you quite a bit. The variation in how founders name their companies ranges from completely opaque to almost offensively descriptive — and that variation is the core challenge in making LLC filing data useful for outreach.

A lot of lead data vendors handle this by appending industry codes manually after the fact, or by matching against established business directories. The problem: those matches only exist for businesses that are already in a directory somewhere. A brand-new LLC formed last Tuesday isn't in any directory yet. That's the whole point of using formation data — these are businesses that don't exist in any database yet.

How We Actually Classify New Filings

We pull new LLC registrations daily from secretary-of-state sources in 8 states — currently Florida, Texas, New York, California, Ohio, Georgia, North Carolina, and Colorado. That runs to roughly 4,000 new filings per day across those states.

We pass each filing through Claude Haiku for classification. I chose Haiku specifically because we're processing thousands of records daily and the cost-per-classification has to be small enough that the whole operation stays economically viable at our price points. Haiku reads the business name, sometimes the registered agent name or address, and outputs a best-guess industry classification. It handles the boilerplate of LLC naming reasonably well — "Consulting," "Services," "Solutions" appended to a person's name triggers different logic than "Plumbing and HVAC" appended to a city name.

The classification is a best guess. I want to be direct about that. When a filing says "MK Ventures LLC," there is no model on earth that can tell you with confidence whether that's a real estate investor, a marketing agency, or someone's side hustle reselling on Etsy. We flag low-confidence classifications so the customer knows which rows need human review. That's better than false confidence, but it means not every row in the daily list is equally useful.

The Gap Between "Fresh" and "Actionable"

Filing date freshness is necessary but not sufficient. A lead filed yesterday is fresh. Whether it's actionable depends on whether you can reach anyone at the company — and that's where LLC filing data gets genuinely hard.

Newly formed LLCs often don't have a website yet. They might not have a business email. The phone number, if there is one, might be a personal cell. We do contact enrichment on a best-effort basis — pulling any available business email or phone data that's surfaced in public records or other open sources — but the enrichment hit rate on brand-new filings is materially lower than it would be for a two-year-old business. There's just less data available.

Some vendors paper over this with padding: they'll sell you "new business leads" but inflate the list with companies that are 12–18 months old because those have better contact data. I don't do that. If you want AlphaLeads, you're getting filings from the last 30 days or less. If you need a company to already have a functioning website and a verified phone number, you probably want Apollo or Lead411 for their existing business database — not a new-formation product.

Why Eight States and Not All Fifty

The most common question I get from people evaluating AlphaLeads is some version of: "Why only 8 states? My territory is [state not on the list]."

It's a fair question. The honest answer is that secretary-of-state websites vary dramatically in how structured and accessible their data is. Some states have clean, well-structured filing feeds. Others require navigating interfaces that weren't designed for automated access, or have data quality issues that make classification much less reliable. I'd rather cover 8 states well than 50 states poorly.

The 8 states we do cover represent a significant share of total US LLC formation volume — Florida and Texas alone account for a disproportionate percentage of new formations nationally. But if your core market is, say, Michigan or Arizona, AlphaLeads isn't the right fit for you right now. That's a real limitation and I'd rather say it plainly than have someone subscribe and be disappointed.

What Makes LLC Filing Data Useful for Cold Outreach

The actual outreach value in fresh LLC data comes from timing. A business that filed last Monday is deciding — right now — which accountant to use, which insurance broker to call, which agency to hire for their website or their ads. They don't have incumbent vendors yet. There's no contract to displace.

That window closes. By the time a new LLC is six weeks old, they've made most of those initial vendor decisions. By the time they show up in InfoUSA or similar compiled data, they've been in business for months and are no longer "new" in any useful sense for that initial-formation outreach strategy.

This is why the typical B2B cold outreach expectation — 1–2% reply rates on a cold email campaign — is actually beatable with new LLC data for specific use cases. It's not magic. It's just that the timing is legitimately better for certain offers: business banking, bookkeeping, business insurance, website builds, payroll setup. These are all things a new LLC founder is actively thinking about in their first 30 days that they'll never think about again in quite the same way.

For broader outreach — finding companies based on technographic signals, existing revenue estimates, employee count — LLC filing data is the wrong tool entirely. Use ZoomInfo or Apollo for that. Formation data is specifically useful for the "newly formed, not yet vendored" outreach motion.

A Note on Contact Enrichment Expectations

I've talked to people who expected to get a list of 500 new LLCs and find phone numbers and direct emails on all 500. That's not realistic for new formations. In early testing, enrichment rates on very fresh filings (under two weeks old) have been lower than we'd like. For filings between two and four weeks old, we see better coverage.

Part of what I'm still working on is improving the enrichment pipeline — adding more open-source data inputs, getting better at surfacing the founder's name from the registered agent or organizer fields, and cross-referencing that with social and public records. It's an ongoing build, not a finished product.

If you want to see what the current data looks like for your specific states or niches, the best thing to do is look at the pricing page on alphaleads.cc and reach out for a sample list. I'd rather you evaluate the actual data than take my word for it.

For more on how I built the daily pipeline behind this, the post "What I Learned Pulling 4,000 New LLC Filings Daily" goes deeper on the architectural decisions. And if you're curious about how filing timing affects lead quality specifically, "Why Most New Business Lead Lists Are Dead on Arrival" covers the freshness problem from a different angle.

Get Fresh Business Leads Daily

500+ new LLC filings delivered every morning, AI-classified by niche. Starting at $29/month.

View Plans