Enterprise level, full stack web scraping suite for business
4.7/5 (28 avaliações)Scrapinghub is a provider of full stack web scraping solutions, offering a suite of platform products for transforming websites into data at scale. Combining technology and services to meet the needs of business users and developers from entry to Enterprise-level, Scrapinghub's custom on-demand data extraction plans are backed by qualified web scraping engineers. In addition, its Crawlera product promises "smart" IP proxy management that operates via HTTP request sends to its API, providing automatic proxy rotation and ban management. Available across scalable pricing plans designed for small to Enterprise-level scraping based on monthly and concurrent requests; Crawlera offers automatic ban detection, user behavior simulation and unlimited bandwidth. Also supported by headless browsers, Scrapinghub's own Splash browser is a purpose-built solution available as an open source project or hosted SaaS product. This lightweight headless browser scrapes data from websites at scale, including JavaScript-generated content, while simulating user behavior via custom scripting.
Scrapy Cloud is the provider's cloud-based platform for managing and automating the deployment of web crawlers or "spiders" via a real time dashboard interface. Built-in QA tools and integration of Scrapinghub’s own Spidermon framework join intelligent scheduling options and customizable containers. Lastly, Scrapinghub's automatic data extraction API (beta) provides eCommerce and article data extraction at scale. The Select Product API returns structured data such as product information from eCommerce URLs, while the Select Article API scrapes news articles and blog posts.
Vantagens
It really saves my time as a web scraping specialists. I always need rotated proxies for my client needs. I keep creating a new project and it is a lot of hassle of I have to buy proxies, renew proxies, and create proxy rotation on my own. I am glad I don't even need to think about it. It just works!
Desvantagens
Well, it's just doesn't have enough advanced documentation on Scrapy Splash combined with Crawlera. Their development team in Github is very responsive though.
Our company, whatoplay.com, aggregates data from multiple sources around the world. Since we started using Crawlera, we haven't really encountered big issues. Our common ones in the last 6 months is just reaching our limit, so we just had to increase our plan.
Vantagens
We're only using Crawlera for our data scraping needs and it has worked seamlessly since we started using it in 2016. When we had to upgrade to a higher limit, the transition was fast, and no dev time was wasted updating our existing codebase.
Desvantagens
Pricing can be a bit tricky especially if your need is right in between the plans.
I have a good experience with scrappinhumb, in a matter of minutes I can develop and publish a new spider.
Vantagens
The software delivers the exactly promised result. It has a friendly interface and an efficient support.
Desvantagens
You need a roadmap of new features and the pricing of third-party services.
Thank you for the review. All new features will be announced in our Support Center.(https://support.scrapinghub.com/support/home) We also have an Ideas Forum (https://support.scrapinghub.com/support/discussions/forums/22000200101) where we would love your input on ideas or new features you would like to see. We appreciate all input from our Customers to continue to help us improve our products.
I use and recommend that platform for years for my customers which need production-ready enterprise-grade data scraping systems.
Vantagens
- Original and flexible technology
- No vendor-lock
- Easy to use for professionals
- Pretty convenient API system for integration with third-party solutions
Desvantagens
- Not so easy to use by non-professional IT person that still wants to use data scraping
- Lack of ability to create some kind of simple and clear user interface for such a persons
- No simple solution for distributed/high-volume crawling
- Lack of monitoring and alerting, non convenient logging system
- Overpriced Crawlera
Thank you for your review we appreciate all customer feedback. We do have a managed service offering for non IT professional who want access to the Data without having any scraping skills.
Started using it 2 years ago, learning curve totally rely on the framework because the cloud platform is pretty intuitive, self explanatory and easy to use.
Whenever I needed support I have been response fast and with a favorable solution.
Vantagens
A full integrated platform for a framework well done for it purpose. Easy to use, based on others frameworks structure, so if you are used to web development in python then it's a piece of cake to create spiders.
The integration (scrapy + scrapinghub) its really good, from a simple deployment through a library or a docker makes it suitable for any need.
Good support and constant improvement on the platform.
A lot of plugins and, open to any feature needed.
Desvantagens
So far there is nothing I dislike about it.
Data on demand:
Once-off: From $500 per site
Data Subscription: $250 per month
Custom: From $2,000 annually
Scrapy Cloud
Starter: FREE
Professional: $9 per unit, per month
Splash
Small: $25 per month
Medium: $50 per month
Large: $100 per month
Enterprise: Custom
Crawlera
C10 Plan: $25 per month
C50 Plan: $100 per month
C100 Plan: $250 per month
C200 Plan: $500 per month
Enterprise: From $1,000 per month
• Scrapinghub is a full stack web scraping platform for business startups and Enterprises, comprising a suite of developer tools alongside managed services for data on demand applications.
• Custom data extraction services are also available, supported by a team of over 100 qualified web scraping engineers.
• Crawlera is a proxy network designed for scalable web scraping based on monthly requests, with main features including automatic ban detection and IP rotation, persistent sessions, headless browser support, plus unlimited bandwidth.
• Splash is Scrapinghub's lightweight headless browser that facilitates the scraping of JavaScript generated content and the simulation of user behavior with custom scripts.
• Scrapy Cloud offers cloud-based management of the deployment and running of web crawlers, comprising a real time dashboard, built-in QA tools including spider monitoring, scheduling, customizable containers etc.
Abaixo estão algumas perguntas frequentes sobre o Scrapinghub.
O Scrapinghub oferece os seguintes planos de pagamento:
A partir de: US$ 9,00/mês
Modelo de preços: Gratuito, Código aberto, Assinatura
Avaliação gratuita: Disponível
Data on demand:
Once-off: From $500 per site
Data Subscription: $250 per month
Custom: From $2,000 annually
Scrapy Cloud
Starter: FREE
Professional: $9 per unit, per month
Splash
Small: $25 per month
Medium: $50 per month
Large: $100 per month
Enterprise: Custom
Crawlera
C10 Plan: $25 per month
C50 Plan: $100 per month
C100 Plan: $250 per month
C200 Plan: $500 per month
Enterprise: From $1,000 per month
O Scrapinghub oferece os seguintes recursos:
Os clientes habituais do Scrapinghub são:
Grandes empresas, Empresas de médio porte, Pequenas empresas
O Scrapinghub está nos seguintes idiomas:
Inglês
O Scrapinghub tem os seguintes planos de preços:
Gratuito, Código aberto, Assinatura
Não temos informações sobre os dispositivos compatíveis com o Scrapinghub.
O Scrapinghub se integra com os seguintes aplicativos:
Dropbox, GitHub, Google Drive
O Scrapinghub oferece as seguintes opções de suporte:
Suporte online, Suporte por telefone
It makes me work really fast but still secure for my clients. I look really professional even though what I did was only suggesting and installing Crawlera. It's awesome!