Thread 105964580 - /g/ [Archived: 1031 hours ago]

/wsg/ - Web Scraping General
7/20/2025, 8:43:02 AM No.105964580
1695719026754
1695719026754
md5: 7f12827f9f019905024af09a8fb29998🔍
Web Scraping General

FAQ: https://rentry.org/scrapists

> Captcha services
https://2captcha.com/
https://www.capsolver.com/
https://anti-captcha.com/

> Proxies
https://hproxy.com/ (no blacklist) (recommended, owned by friend of /wsg/)
https://infiniteproxies.com/ (no blacklist)
https://www.thunderproxies.com/
http://proxies.fo/ (not recommended)

> Network analysis
https://mitmproxy.org/
https://portswigger.net/burp

> Scraping tools
https://beautiful-soup-4.readthedocs.io/en/latest/
https://www.selenium.dev/documentation/
https://playwright.dev/docs/codegen
https://github.com/lwthiker/curl-impersonate
https://github.com/yifeikong/curl_cffi

> Cool projects by members of our community
doubledouble.top / lucida.to - Free music scraped from spotify
nekohouse.su - Kemonoparty for fanbox/fantia/subscribestar
tv.weboasis.app - Falcon, a goy invite-only pirate streaming service that scrapes video streams from multiple sources

asking for the new telegram edition (i lost my old tg account and the backup signal :/)
Replies: >>105964728 >>105965830
Anonymous
7/20/2025, 9:06:36 AM No.105964717
oops i messed up... oh well
Anonymous
7/20/2025, 9:08:15 AM No.105964728
>>105964580 (OP)
Come post with us on metachan
Replies: >>105965018
Anonymous
7/20/2025, 10:05:43 AM No.105965018
>>105964728
could i steal a tg link? (or signal <3)
tried finding heavens but i cant
can share pgp if you prefer
Replies: >>105965450
Anonymous
7/20/2025, 10:08:30 AM No.105965041
Subtle tranny colour scheme
Anonymous
7/20/2025, 10:19:23 AM No.105965100
There's nothing to scrape.
Anonymous
7/20/2025, 11:21:03 AM No.105965450
SPP4M
SPP4M
md5: 51f2e1b50cc4bbafd15045a691c86e77🔍
>>105965018
This general's tg is @scrapists according to the previous OP >>105917015. That's all I know.
Replies: >>105965698
Anonymous
7/20/2025, 12:06:24 PM No.105965698
>>105965450
its broken :/
Anonymous
7/20/2025, 12:32:17 PM No.105965830
>>105964580 (OP)
would recommned proxycheap as it can be used to scrape .gov domains (at least from my experience).
https://www.proxy-cheap.com/
Replies: >>105965853
Anonymous
7/20/2025, 12:34:58 PM No.105965853
1733033984164700
1733033984164700
md5: 07309ff7f47498ccd5cf700a92f7ba73🔍
>>105965830
What are you scraping on .gov space?
Replies: >>105966143
Anonymous
7/20/2025, 1:15:48 PM No.105966143
>>105965853
In my cuntry there is a government-backed search engine with no authentication that allows you to find someone's real name, address and some other PIE by just knowing the number of their residential record. I wrote a basic scraper that brute-forces those in multiple threads with different proxies. Thanks to that I have obtained names and addresses of all people in my local area who have some share in any regional estate. Sadly, they will soon make it so you need to go through KYC in order to use this website.
Replies: >>105967520
Anonymous
7/20/2025, 4:08:19 PM No.105967520
>>105966143
Scrap as much info as you can, maybe y ou can sell it to data brokers