Tried Scraping, AI Agents, and APIs… None Worked. So I Built a Chrome Extension.

Leader posted 3 min read

I Disappeared for 15 Days… Because I Was Building This Chrome Extension

For the past couple of weeks, I completely stopped posting about LeadIt.

Not because I quit.
Not because the project died.

But because I hit one of the most frustrating problems I’ve faced while building a product.

Getting good lead data.


The Problem I Hit While Building LeadIt

LeadIt’s UI and UX were already finished.

But the real challenge wasn’t the product.

It was the data.

I needed things like:

  • company name
  • founder name
  • founder LinkedIn
  • company website
  • company size
  • location
  • YC batch

And the biggest pain?

Founder emails are hidden almost everywhere.

Platforms like Y Combinator don’t expose them easily.
So I decided to ignore emails for now and add them later.

But then another problem appeared.


Manual Data Collection Was Killing Me

To build the LeadIt database, I started collecting data manually from Y Combinator companies.

Open company page → copy data → paste → move to next.

Simple, right?

Wrong.

Even a single lead took around 5 minutes.

After doing this repeatedly, I started hating the process.

It was slow.
It was boring.
And honestly… it felt stupid.

There had to be a better way.


My First Attempt: Scraping

So I tried Playwright.

The idea was simple:

Automate the browser → scrape YC pages → collect the data.

But there was one big problem.

Platforms don’t like scrapers.

Pages started blocking the requests.

So scraping wasn’t going to scale.


My Second Attempt: AI Agents

Then I tried something more experimental.

I built an AI agent using LangChain and used Serper API as a tool for searching data.

The plan was:

AI agent finds companies → extracts founder info → structures the data.

In theory it sounded amazing.

In reality?

The data was terrible.

Instead of startup founders, it returned things like:

  • random big tech companies
  • irrelevant sites
  • incomplete information

The data quality was so bad that it was unusable.

So I stopped everything.


I Took a Break for Two Days

For two days, I didn’t touch the project.

I just kept thinking:

“How do tools actually collect data from websites without scraping or APIs?”

Because most platforms don’t give you APIs.

LinkedIn?
You need partnerships.

Apollo?
Paid access.

Hunter?
Paid.

Everything was locked.

Then suddenly the idea hit me.


The Idea: What If the Browser Reads the Page?

Instead of scraping servers…

What if the browser itself reads the page?

Right when the user opens it.

That’s when the idea of building a Chrome extension came.

The extension would:

  • read the page DOM
  • extract the data
  • structure it instantly

No scraping servers.
No blocked requests.

Just reading the page already loaded in the browser.

So I started building it.


Another Problem: Every Page Has Different CSS

When I started building the extension, I hit another challenge.

Each page had different CSS selectors.

So relying on simple selectors like .class-name wouldn’t work.

At first it looked messy.

But after spending a full day analyzing the YC page DOM, I discovered something interesting.

Even though CSS classes were different…

The underlying DOM structure patterns were consistent.

So instead of targeting CSS classes, I started targeting DOM structure patterns.

And suddenly…

It worked.


Introducing: Kallector

That’s how Kallector was born.

Kallector is a Chrome extension that reads startup data directly from the page DOM.

No scraping servers.
No API access needed.

Just open the page.

Kallector reads it.

And structures the data instantly.


Current Features

Right now Kallector can collect data from Y Combinator company pages, including:

  • Company name
  • Company website
  • Founder name
  • Founder LinkedIn
  • Company size
  • Location
  • YC batch

The extension reads the page DOM and shows the structured data inside its panel.


What Kallector Solves

Kallector turns 5 minutes of manual work into seconds.

Instead of copying data manually across multiple tabs…

You open the page and the data is already structured.

This is especially useful for:

  • startup research
  • building lead databases
  • founder outreach
  • sales prospecting

What’s Next

Right now Kallector only supports Y Combinator company pages.

But the plan is to expand to:

  • LinkedIn profiles
  • startup directories
  • other founder databases

Originally I built Kallector only to power the LeadIt database.

But now I’m starting to see it as something bigger.

It might become a separate SaaS product on its own.


If you’ve ever tried collecting startup data manually…

You know exactly why I built this.

And this is just the beginning.

2 Comments

1 vote
1
2 votes
1
0
2
1

More Posts

How I Built a React Portfolio in 7 Days That Landed ₹1.2L in Freelance Work

Dharanidharan - Feb 9

I’m a Senior Dev and I’ve Forgotten How to Think Without a Prompt

Karol Modelskiverified - Mar 19

Sovereign Intelligence: The Complete 25,000 Word Blueprint (Download)

Pocket Portfolio - Apr 1

Just completed another large-scale WordPress migration — and the client left this

saqib_devmorph - Apr 7

Why Email-Only Contact Forms Are Failing in 2026 (And What Developers Should Do Instead)

JayCode - Mar 2
chevron_left

Related Jobs

View all jobs →

Commenters (This Week)

2 comments
1 comment
1 comment

Contribute meaningful comments to climb the leaderboard and earn badges!