cpeffer's comments

cpeffer · on May 1, 2024

If you’re looking for an open core version of this check out firecrawl.dev

cpeffer · on April 14, 2024

Very cool. We posted about a similar tool we built yesterday

It also crawls (although you can scrape single pages as well)

cpeffer · on April 13, 2024

It crawls webpages (finds subdirectories), handles JS blocking with fallbacks to headless browsers, and does this all concurrently.

If only that script worked for every website. But, alas, it does not.

cpeffer · on April 13, 2024

* Creator here - Thats the goal!

donohoe · on April 14, 2024

And you honor or ignore robots.txt?

cpeffer · on April 14, 2024

It wasn't in our initial version (we didn't plan on launching today), but we are pushing an update to do so now.