I used scrapy a lot. Just my opinion: 1. Instead of creating a urls global varia...

sintezcs · on Sept 13, 2021

Can you please give some details about your second point? What’s wrong with beautifulsoup?

aynyc · on Sept 13, 2021

Using CSS & XPATH to select elements is very natural to web pages. BS4 has very limited CSS selector support and zero XPATH support.

estebarb · on Sept 13, 2021

It is very slow. But personally, I prefer to write my crawlers in Go (custom code, not Colly).

zatarc · on Sept 13, 2021

It's way faster and has better support for CSS selectors.

thegeekpirate · on Sept 14, 2021

> But personally, I prefer to write my crawlers in Go (custom code, not Colly).

This is my current setup as well, been scraping on and off for 20+ years now.

moehm · on Sept 14, 2021

What's your problem with Colly? [0]

estebarb · on Sept 14, 2021

Mostly that I started my crawler before learning about Colly and it didn't make sense to rewrite the code.

By "not Colly" I just wanted to remark that in Go is relatively easy to write a crawler from scratch.