Nume MacAroon Ⓥ @nm

**neuralrow** @neuralrow@mastodon.social · 2d

neuralrow @neuralrow@mastodon.social

We Are HIRING!!!!

If you're an AI Engineer passionate about Web-scraping and architecture development, please fill out this form:

https://docs.google.com/forms/d/e/1FAIpQLSegdoBsNM352nAR3mJBxfO1ynBYAMP9a_zhL6T_qpiafTwLxg/viewform

#DataScience #DataScientist #AI #WebScraping#Selenium #MachineLearning #CareerOpportunity #JobOpening #TechCareer #STEM #hiringinterns #hiringfreshers #activelyseeking #activelylooking #careers #internshiring #internship #interns #hiringfreshers #hiringnow #hiring #internship2025 #internshipexperience #hiringalert #Python

We Are HIRING!!!!

If you're an AI Engineer passionate about Web-scraping and architecture development, please fill out this form:
https://docs.google.com/forms/d/e/1FAIpQLSegdoBsNM352nAR3mJBxfO1ynBYAMP9a_zhL6T_qpiafTwLxg/viewform

#DataScience #DataScientist #AI #WebScraping #Selenium #MachineLearning #CareerOpportunity #JobOpening #TechCareer #STEM #hiringinterns #hiringfreshers #activelyseeking #activelylooking #careers #internshiring #internship #interns #oppertunities #learninganddevelopment #hiringfreshers #hiringnow #hiring #internship2025 #internshipexperience #hiringalert #Python

**muazn** @muazn@mastodon.social · 3d

muazn @muazn@mastodon.social

Just noticed this setting on Cloudflare Probably not great from a sustainability point of view but I can easily see this turning into a popular enabled setting just to spite AI companies... i.e. Let’s see how these bots like an infinite loop instead of my content.

#Cloudflare AI# #Webscraping #AIBOT #AIScraping

**PromptCloud** @promptcloud@mastodon.social · 4d

PromptCloud @promptcloud@mastodon.social

Scraping stopped by hCaptcha?
Here’s your cheat sheet for doing it ethically, effectively, and without breaking a sweat.
Let PromptCloud help you get back to clean, compliant data.
Read more here: https://shorturl.at/mBEa8

hCaptcha Solver: How AI and Automation Are Changing CAPTCHA Challenges

#hCaptcha #WebScraping #DataExtraction

**PromptCloud** @promptcloud@mastodon.social · 6d

PromptCloud @promptcloud@mastodon.social

Think Google Maps is just for directions? Think again.
Businesses are turning pins into powerful insights from competitor tracking to lead generation.
Read the article to know more: https://shorturl.at/7gQWZ

#GoogleMapsData #WebScraping #LocationIntelligence

Continued thread

**@reiver ⊼ (Charles)** @reiver@mastodon.social · Apr 17

Apr 17

@reiver ⊼ (Charles) @reiver@mastodon.social

web scraper

Continued thread

**@reiver ⊼ (Charles)** @reiver@mastodon.social · Apr 17

Apr 17

@reiver ⊼ (Charles) @reiver@mastodon.social

web scraper

Continued thread

**@reiver ⊼ (Charles)** @reiver@mastodon.social · Apr 17 *

Apr 17 *

@reiver ⊼ (Charles) @reiver@mastodon.social

web scraper

Continued thread

**@reiver ⊼ (Charles)** @reiver@mastodon.social · Apr 17 *

Apr 17 *

@reiver ⊼ (Charles) @reiver@mastodon.social

web scraper

**@reiver ⊼ (Charles)** @reiver@mastodon.social · Apr 17

Apr 17

@reiver ⊼ (Charles) @reiver@mastodon.social

web scraper

**PromptCloud** @promptcloud@mastodon.social · Apr 15

Apr 15

PromptCloud @promptcloud@mastodon.social

The smartest brands aren’t guessing anymore. They’re automating how they collect and act on web data at every step:

• Dynamic pricing that adapts to the market

• Spotting trends before competitors do

• Understanding what customers really think

It’s how data becomes a competitive edge.

Explore how top DTC brands are using web scraping to fuel growth: https://bit.ly/3G60wdu

#DTC #ecommerce #pricingstrategy

**DeployHQ** @deploybot@mastodon.social · Apr 15

Apr 15

DeployHQ @deploybot@mastodon.social

Automate web scraping & deployment! Python, ScraperAPI & DeployHQ tutorial: extract data & streamline workflows. No more manual entry!

https://www.deployhq.com/blog/scrape-applications-using-scraperapi-and-deployhq

#webscraping #python #automation

**PromptCloud** @promptcloud@mastodon.social · Apr 14

Apr 14

PromptCloud @promptcloud@mastodon.social

Here’s how industries are turning unstructured web data into strategic insights:

Track real-time pricing, trends, and customer sentiment
Extract market intelligence at scale
Make faster, data-backed decisions across teams

Explore how different sectors are using web scraping to fuel smarter decisions and faster growth :

https://bit.ly/4jw6HpI

GIF

#WebScraping #MarketResearch #FinancialAnalytics

**PromptCloud** @promptcloud@mastodon.social · Apr 9

Apr 9

PromptCloud @promptcloud@mastodon.social

There’s too much TikTok data to analyze manually.

Scrapers help marketers:

Cut through the noise
Track what's actually trending
Make faster, data-driven decisions

Learn how in our latest blog → https://bit.ly/4lnO7ll

#WebScraping #SocialListening #JobsPikr

**Rachel Rawlings** @linuxandyarn@infosec.exchange · Apr 7

Apr 7

Rachel Rawlings @linuxandyarn@infosec.exchange

I'm having trouble figuring out what kind of botnet has been hammering our web servers over the past week. Requests come in from tens of thousands of addresses, just once or twice each (and not getting blocked by fail2ban), with different browser strings (Chrome versions ranging from 24.0.1292.0 - 108.0.5163.147) and ridiculous cobbled-together paths like /about-us/1-2-3-to-the-zoo/the-tiny-seed/10-little-rubber-ducks/1-2-3-to-the-zoo/the-tiny-seed/the-nonsense-show/slowly-slowly-slowly-said-the-sloth/the-boastful-fisherman/the-boastful-fisherman/brown-bear-brown-bear-what-do-you-see/the-boastful-fisherman/brown-bear-brown-bear-what-do-you-see/brown-bear-brown-bear-what-do-you-see/pancakes-pancakes/pancakes-pancakes/the-tiny-seed/pancakes-pancakes/pancakes-pancakes/slowly-slowly-slowly-said-the-sloth/the-tiny-seed

(I just put together a bunch of Eric Carle titles as an example. The actual paths are pasted together from valid paths on our server but in invalid order, with as many as 32 subdirectories.)

Has anyone else been seeing this and do you have an idea what's behind it?

#botnet #ddos #webscraping

**Evan Hahn** @EvanHahn@bigshoulders.city · Apr 3

Apr 3

Evan Hahn @EvanHahn@bigshoulders.city

"How crawlers impact the operations of the Wikimedia projects" https://diff.wikimedia.org/2025/04/01/how-crawlers-impact-the-operations-of-the-wikimedia-projects/

Diff · Apr 1How crawlers impact the operations of the Wikimedia projectsSince the beginning of 2024, the demand for the content created by the Wikimedia volunteer community – especially for the 144 million images, videos, and other files on Wikimedia Commons – has grow…

#Wikipedia #WebScraping #LLM

**Matt Hodgkinson** @mattjhodgkinson@scicomm.xyz · Apr 1

Apr 1

Matt Hodgkinson @mattjhodgkinson@scicomm.xyz

For Immediate Release, April 1, 2025: University of Michigan Press will publish all of the content on Meta platforms as a series of printed books.
https://www.linkedin.com/posts/charles-watkinson-7553a257_amphibians-and-reptiles-of-the-great-lakes-activity-7312775744932179968-sLSu

Charles Watkinson.
Associate University Librarian, Publishing, University of Michigan

For Immediate Release, April 1, 2025: University of Michigan Press will publish all of the content on Meta platforms as a series of printed books.

University of Michigan Press, a medium-sized publisher of scholarly monographs and regional studies such as Amphibians and Reptiles of the Great Lakes Region (https://lnkd.in/gS69nebW) has embarked on an ambitious new project. Over the next 94 years, the Press will stop publishing award-winning books in the humanities and social sciences. It will instead devote all its efforts to republishing all the content on Facebook, Instagram, Threads, and WhatsApp in print.

"We initially thought of asking Meta if they minded us using their copyrighted content," said Charles Watkinson, director of the Press. "But there was someone on WhatsApp who said that it would probably take, like, 4 weeks for Meta to send us all their stuff in PDF format. Also we didn't know Meta's address so we thought it would be better to take the harvesting route."

The Press is collaborating with Prestige Nail and Spa in Singapore to acquire the content. Qinfan Banquan, director of operations, preemptively apologized for potential issues. "Our bots can go a bit wild sometimes. If you find that your Instagram is not working for a few days, it's probably a rogue crawler trying to download the same post about Pink Pony Club six million times a second."

...

#MetaPlatforms #Instagram #Facebook

**ProxySocks5** @proxysocks5@mastodon.social · Mar 31

Mar 31

ProxySocks5 @proxysocks5@mastodon.social

Doing market research or tracking prices online?

With ProxySocks5, you can run your scrapers 24/7 without worrying about blocks or limits. Our static and rotating proxies help you collect data smoothly from the locations you need—right down to the city.

Choose between HTTP, SOCKS5, Shadowsocks, Trojan, and WireGuard VPNs, available as datacenter or residential IPs. No data caps, no throttling, and no headaches.

https://proxysocks5.com

#WebScraping#ResidentialProxies #ProxySocks5

**Techdirt** @techdirt.com@web.brid.gy · Mar 29

Mar 29

Techdirt @techdirt.com@web.brid.gy

Not Content With Its Billions Of Web Scrapings, Clearview Tried To Buy Millions Of Mugshots And SSNs

https://web.brid.gy/r/https://www.techdirt.com/2025/03/28/not-content-with-its-billions-of-web-scrapings-clearview-tried-to-buy-millions-of-mugshots-and-ssns/

Techdirt · Mar 29Not Content With Its Billions Of Web Scrapings, Clearview Tried To Buy Millions Of Mugshots And SSNsClearview saw an opening in the facial recognition market and took full advantage of it. While most tech firms offered face-matching tech of dubious accuracy, Clearview went further, matching its A…

#webscraping

**Programming Historian** @proghist@hcommons.social · Mar 17

Mar 17

Programming Historian @proghist@hcommons.social

Aprende sobre la técnica de adquisición de datos conocida como #WebScraping y extrae con R los datos textuales publicados en una página web gracias a esta lección de @rivaquiroga

https://doi.org/10.46430/phes0061

doi.orgIntroducción al web scraping usando R | Programming Historian