Proxy Market News - Proxyway https://proxyway.com/news Your Trusted Guide to All Things Proxy Fri, 26 Apr 2024 11:20:50 +0000 en-US hourly 1 https://wordpress.org/?v=6.5.2 https://proxyway.com/wp-content/uploads/2023/04/favicon-150x150.png Proxy Market News - Proxyway https://proxyway.com/news 32 32 Bright Data Equalizes Residential and Mobile Proxy Rates https://proxyway.com/news/bright-data-equalizes-residential-mobile-proxy-rates https://proxyway.com/news/bright-data-equalizes-residential-mobile-proxy-rates#respond Fri, 26 Apr 2024 11:19:16 +0000 https://proxyway.com/?post_type=news&p=21308 In essence, the provider’s mobile proxies just got 65% cheaper.

The post Bright Data Equalizes Residential and Mobile Proxy Rates appeared first on Proxyway.

]]>

News

In essence, the provider’s mobile proxies just got 65% cheaper. 

Adam Dubois

Bright Data, the Israeli web data platform, has revised the pricing of its mobile proxy network. 

The rates are now identical to Bright Data’s residential proxy product – and also 65% cheaper compared to their previous iteration.

 Pay as you go$500$1,000$2,000
New price/GB$8.40$7.14$6.30$5.88
Old price/GB$24$20.40$18$16.80
Difference-65%-65%-65%-65%

Following the change, Bright Data has become an extremely affordable option for mobile proxies, beating most premium peers and competing neck and neck with mid-market providers. 

bright data mobile proxy price comparison

After the second price decrease within a year, Bright Data’s mobile IP network now costs one fifth of the price compared to this time in 2023. 

The move shows how ruthlessly proxy providers have been cutting costs in an effort to push out their competition. We also interpret this as a signal that peer-to-peer mobile proxies are losing their premium aura: SOAX chose the same pricing strategy last August, and it’s also been adopted by multiple minor providers like AnyIP and ASocks.

Get proxy news and updates directly to your inbox.



The post Bright Data Equalizes Residential and Mobile Proxy Rates appeared first on Proxyway.

]]>
https://proxyway.com/news/bright-data-equalizes-residential-mobile-proxy-rates/feed 0
Rayobyte Publishes a Whitepaper on Preventing Proxy Abuse https://proxyway.com/news/rayobyte-abuse-prevention-whitepaper https://proxyway.com/news/rayobyte-abuse-prevention-whitepaper#respond Fri, 26 Apr 2024 11:05:24 +0000 https://proxyway.com/?post_type=news&p=21302 The document describes Rayobyte’s three-pronged approach against bad actors.

The post Rayobyte Publishes a Whitepaper on Preventing Proxy Abuse appeared first on Proxyway.

]]>

News

The document describes Rayobyte’s three-pronged approach against bad actors.

Adam Dubois

Rayobyte, the US-based provider of proxies and web scraping tools, has published a whitepaper that lists the measures it takes to prevent malicious proxy use. 

The paper introduces Rayobyte’s three-pronged approach of prevention, detection, and response. The implementation differs depending on the proxy origin, whether the IPs come from data centers or residential users. 

  1. Prevention involves measures like port blocking and rate limiting for datacenter proxies. The residential proxy network requires additional safeguards due to its more anonymous nature, such as a KYC procedure and a blacklist.
  2. Detection comprises automated monitoring for potential DDoS activities, unauthorized access to graylist websites, and usage that fails to match the customer’s stated purpose. It also involves manual detection by investigating abuse emails and tracking spam databases.
  3. Response actions take place following a complaint – Rayobyte concedes that the step is mostly relevant for datacenter proxies which have a clear business entity assigned to them. In this case, that entity is often Rayobyte. 


All in all, the crux of fighting abuse with residential proxies lies in prevention, while datacenter proxy servers, which are presented as a less attractive tool for misuse, rely more on reactive measures. 

We’re happy to see that proxy server vendors are starting to take ethical proxy use with increasing importance. Despite a few slip-ups throughout the years, Rayobyte has been one of the more vocal voices in the field. 

You can read the whitepaper here.

Get proxy news and updates directly to your inbox.

The post Rayobyte Publishes a Whitepaper on Preventing Proxy Abuse appeared first on Proxyway.

]]>
https://proxyway.com/news/rayobyte-abuse-prevention-whitepaper/feed 0
ScrapeCon 2024: A Recap https://proxyway.com/news/scrapecon-2024-recap https://proxyway.com/news/scrapecon-2024-recap#respond Thu, 11 Apr 2024 06:57:57 +0000 https://proxyway.com/?post_type=news&p=20824 Our impressions from Bright Data’s first virtual conference on web scraping.

The post ScrapeCon 2024: A Recap appeared first on Proxyway.

]]>

News

Our impressions from Bright Data’s first virtual conference on web scraping. 

Adam Dubois
scrapecon recap illustration

Bright Data’s web scraping conference, ScrapeCon, took place on April 2. It’s become our tradition to recap major industry events – let this be no exception. The article shares our impressions and provides brief (and often opinionated) summaries of the conference’s talks. If any catches your eye, Bright Data has made the videos available on demand

I find ScrapeCon particularly interesting, as it’s Bright Data’s first take at conferences (that we know of). Given how good Zyte’s Extract Summit and Oxylabs’ OxyCon have become over the years, expectations were high. 

Was the host able to meet them? Well, it depends. From the technical side – absolutely. The event felt very well organized and really had no technical issues to speak of. At the same time, there was a lot of product promotion and less interactiveness than we expected.

Organizational Matters

ScrapeCon was initially planned to take place on November 7, 2023. Unfortunately, it had to be delayed due to events in Israel. No news was given for a while, and then Bright Data announced a new date some time in March. 

Registration was free, which is nice and, by now, an industry standard for online events. You had to fill out a form with basic details like name, surname, and email. The application also asked to select which sessions you expected to attend. (Was it to gauge interest? Were you unable to watch the presentations you didn’t choose? I ticked all the boxes, so I couldn’t test what happened in that scenario.) Then, ScrapeCon’s team would verify your application and manually approve (or deny?) entry. 

I believe Bright Data used a platform called Cvent to host the conference. The hub was well-made, with notifications and chat boxes for Q&A. Unlike Extract Summit or OxyCon, each session had a different page with a description and Vimeo player. Some talks took around 15 minutes, so you had to jump in and out constantly. 

scrapecon conference hub
The online platform had many bells and whistles.

Bright Data opted to host the conference between 3PM and 7PM CEST. For us – and quite a few businesses in Europe – the time was unusual, but it was likely chosen to include both European and American audiences. All sessions were on the dot. The schedule had no formal breaks, but you’d typically get between five and ten minutes before the next talk. 

One feeling I couldn’t shake throughout the event was that I acted much more as a viewer rather than an active participant. Sure, you were free to ask questions. But all answers were given in text, and speakers never actually interacted with the audience directly. Everything might as well have been pre-recorded, and you wouldn’t have missed much.

The Talks

Opening Speech: The State of Public Web Data

Or Lenchner kicked off the conference with Bright Data’s narrative of the status quo. It pepped up the event and gave a good idea of what to expect. 

In brief, the demand for data is growing exponentially, and like drinking water, it’s poised to shape the future of humanity. AI is the next big revolution, and naturally, it requires data to train. Big corporations are gating off, but Bright Data secured some important victories (mostly notably against Meta), and regulation is the next frontier. 

Or mentioned something called the scraping cloud – I assume this is how Bright Data will position itself from this point on. He also teased Bright Shield, a two-pronged product currently in beta:

  • Bright Data’s clients can use it to track and enforce their web scraping policies, while
  • Webmasters are able to see which parts of their properties are scraped and – to an extent – influence data collection activities on those domains.
scrapecon intro
Data is being generated at an exponential rate.

Talk 1: Cloud-Native Scraping Made Simple

In the first presentation, Itzhak Yosef Friedman, Head of R&D at Bright Data, spoke about serverless web scraping infrastructure. More precisely, he introduced Bright Data Functions, previously called Web Scraper IDE, previously called Data Collector. 

In brief, Functions is a cloud-based development environment accessible through Bright Data’s dashboard. It includes pre-built code functions, templates for popular websites, and uses the provider’s unblocking infrastructure in the backend. There are other neat features built in, such as tools for debugging and monitoring. If needed, Functions can run headless browsers, also in the cloud. 

Itzhak then quickly built an interactive scraper to collect data from Lazada to show off the product. I found it nice that Functions separates the scraping and parsing stages and caches pages, so you can adjust the parser without reloading the URL. This was the first product advertisement of the day.

scrapecon talk 1
Functions, the web scraping IDE, in a nutshell.

A few sidenotes. It’s interesting to see how Functions evolved. The initial iteration, Data Collector, was actually marketed as a no-code tool: the templates had a UI layer, and there was an extension for visually building scrapers . Bright Data then pivoted toward developers, leaving the no code part to datasets. 

This was also the first presentation to feature Bright Data’s diagram of the modern web pipeline. It would appear time and time again throughout the conference.

scrapecon data pipeline
The first of the many times we'll see this slide.

Talk 2: Decoding Scraping Strategies: Build, Buy, or API?

Nir Borenshtein, Bright Data’s COO, spoke about the considerations of building a scraper in-house versus outsourcing parts of the process. The presentation was once again very much focused on Bright Data’s tooling, but in a more general, here’s what our platform (scraping cloud) looks like, kind of way. 

Nir began by elaborating on the web data pipeline we saw in the previous session, speaking about the challenges that arose throughout the years and how they led to the creation of Web Unlocker. He then characterized three models for data collection: in-house, hybrid, and data-as-a-service. Afterwards, Nir illustrated his points with several case studies and finally presented Bright Data platform’s layers.

Overall, it was a decent high-level overview, perhaps more from a business executive’s point of view. I’m not sure why this talk didn’t go after the intro – maybe that was the original design, but it got rearranged somewhere in the process.

scrapecon talk 2
Whichever path you take, Bright Data has something to offer.

Panel 1: The Future of Data for AI: Balancing Legal and Operational Challenges

Bright Data’s CEO Or Lenchner, Jo Levy from Norton Law Firm, and Anthony Goldbloom, Co-founder of Kaggle, discussed the role of web data in AI, the challenges collecting this data raises, and how to navigate them in the day-to-day. Jennifer Burns from Bright Data moderated the discussion.

Being accustomed to specialist panels, I found this intersection interesting, but it turned out to provide useful insights. Anthony, whose company uses web data to fine-tune LLMs, outlined a framework for data collection. It includes knowing your sources, depth of coverage, and the possible legal implications. 

Or raised thought-provoking points about the inherent bias in selecting datasets for LLMs and the increasing importance of data freshness. Jo’s most valuable argument, in my opinion, was that lenses like copyright or bias should be addressed in the curation rather than the selection stage. Other topics were touched upon as well, such as self-regulation. 

All in all, I enjoyed the discussion and recommend watching it. I also found it amusing how self-effacing Bright Data appeared compared to the other sessions: Or suggested that their services may not be necessary at first, and Anthony admitted using multiple vendors to mitigate risk.

scrapecon panel 1
The panelists never appeared all on one screen, so we work with what we have.

Talk 3: From AI-Powered Insights to Training LLMs

Lior Levhar from Bright Data ran through the best practices for creating datasets for large language models. The target audience was probably business types looking to get into LLM training, as most advice turned out pretty basic: tailor the dataset for your use case, don’t make it too big or small, remove duplicates, and validate unreliable sources like social media.

Throughout the session, Lior subtly referred to Bright Data’s datasets several times and whipped up a practical demonstration with Snowflake as the analytics tool. Although the session has limited value for data professionals, it’s a decent introduction for others.

scrapecon talk 3
The size of your database has to be just right.

Talk 4: A Blueprint for Building a Reliable Dataset

Itamar Amramovich from Bright Data continued the topic of datasets, diving deeper into how they’re crafted at his company. The presentation was heavily sales-driven, leaning on marketing lingo and rhetorical techniques. 

In brief, Itamar went through what he called the pillars of data reliability. The first, data collection, relies on proxies and unblocking infrastructure and applies various strategies, such as discovering sources through search engines and scraping internal APIs. Schema design requires defining core elements and cleaning raw output, while data validation looks into fill rate, uniqueness, min-max thresholds, and other metrics.

Marketing aside, the talk delivered on its promise of providing a blueprint, all the while letting us gleam into Bright Data’s internal tooling, such as its universal schema for datasets.

scrapecon talk 4
The pillars of data reliability.

Panel 2: The Executive Playbook

The second panel featured Ernesto Cohnen from Ixigo, Michael Beygelman from Claro Analytics, and Ganesh Kumar from Rakuten. It was hosted by Bright Data’s CCO Omri Orgad. The discussion’s professed aim was to discuss the best practices for operational and financial aspects of real-time data collection. 

In reality, it touched upon many topics, including but very much not limited to LLMs and their impact on business operations. The participants brought expertise from their respective fields, sharing individual challenges that sometimes intersected around the topics of multimodality of data, the growing importance of real-time decisions, and the need to discern data originators from regurgitators. 

To give you some examples, Ernesto encountered that around 10% of new Google Images are now AI-generated, which is a problem for a travel portal. Ganesh has found it crucial to understand the references that customers use to shop for products, and sometimes, you need to search in unlikely places (such as ticketing websites). And Michael emphasized that we live in an era of just-in-time where businesses require prescriptive data.

Ernesto concluded with a beautiful thought that the value of data lies in the stories you build with it. All in all, I found the panel to be one of the highlights of the conference.

scrapecon panel 2
One again, the panelists never appeared all at once during the talk, so let’s look at their beautiful profile pictures instead.

Talk 5: From Clicks to Captures: Mastering Browser Interactions for Scrapers

Aviv Besinsky and Ilya Kolker from Bright Data introduced the challenges of scraping dynamic websites and their approach to solving them. Some of the roadblocks include maintaining headless browser infrastructure, avoiding blocks, and parsing rendered content. Hard, right?

Well, you can always use Scraping Browser to overcome them. It scales automatically and has unblocking built in. Aviv provided a case study where a customer saved $5,000 by switching, and Ilya proceeded to demonstrate a scenario where he programmed a scraper to visit Amazon, type in Laptop, and download the page. All in all, it was a product pitch if you’re in the market for it.

scrapecon talk 5
Dynamic scraping induces several layers of headaches – but they’re nothing that can’t be solved with good tools (insert affiliate link here).

Talk 6: Beyond IP Bans & CAPTCHAs

Another presentation by the Aviv Besinsky and Ilya Kolker duo. This time, they discussed the challenges stemming from website protection against web scraping. These range from simple IP blocks to sophisticated behavior tracking.

Aviv first showed a timeline with the evolution of anti-bot technologies. He then overviewed the main anti-scraping methods in order of difficulty and provided some ways to overcome them. The guidelines were on a pretty high level (e.g. emulate real user interactions), which serves more as a starting point for further research than directly applicable advice.

In the second part, Ilya demoed how datacenter and residential proxies fare in two scenarios: accessing Amazon and G2. He used a basic scraper without and then with headers. After both failed with G2, Ilya switched to Bright Data’s Web Unlocker (which runs headless browsers and solves CAPTCHAs if needed) and successfully scraped the page.

scrapecon talk 6
I was about to write an angry letter about the lack of memes, but Aviv turned it around.

Panel 3: From Initial Request to Final Analysis

A panel with five tech influencers: Tech with Tim, Coding with Lewis, TiffinTech, Python Simplified, and Tech Bible. I expected a discussion format, but it looked like the participants were given a list of questions and recorded their answers individually. 

In any case, they covered a variety of topics, ranging from the very basics like what is web data? to the challenges developers face while scraping and their favorite tech stacks. I wouldn’t consider this panel crucial to watch, but it was entertaining and ended the conference on a lighter note.

scrapecon panel 3
The topic is broader than the shoulders of an olympic swimmer.

Conclusion

That was ScrapeCon. Despite being heavier on sales than we’re used to, I believe it was a success. Congratulations to Bright Data on their first online conference! 

If you’ve reached this far and would like to read up on other major industry events, here’re our recaps of 2023’s Extract Summit and OxyCon.

Get proxy news and updates directly to your inbox.

The post ScrapeCon 2024: A Recap appeared first on Proxyway.

]]>
https://proxyway.com/news/scrapecon-2024-recap/feed 0
Infatica Makes Its Residential Proxies Up to 43% Cheaper https://proxyway.com/news/infatica-residential-price-decrease-april-2024 https://proxyway.com/news/infatica-residential-price-decrease-april-2024#respond Tue, 09 Apr 2024 11:37:56 +0000 https://proxyway.com/?post_type=news&p=20805 The provider’s plans now give significantly more traffic for the same price.

The post Infatica Makes Its Residential Proxies Up to 43% Cheaper appeared first on Proxyway.

]]>

News

The provider’s plans now give significantly more traffic for the same price. 

Adam Dubois

Infatica, the Singapore-based provider of proxies and web scraping tools, has revised the rates of its residential proxy traffic.

Though the pricing of its plans hasn’t changed, Infatica’s plans now grant significantly more traffic. As a result, they’ve become up to 43% cheaper.

 $96$360$700$2,400$3,500
New traffic14 GB60 GB140 GB600 GB1,200 GB
Old traffic8 GB40 GB100 GB400 GB1,000 GB
New price/GB$6.86$6$5$4$2.92
Old price/GB$12$9$7$6$3.5
Difference-43%-33%-28%-33%-17%

The revision places Infatica among mid-market providers in terms of price, whereas it used to position as a premium company.

infatica residential proxy price comparison graph april 2024

This is Infatica’s first price update in a while. It was much needed, as some key competitors like Smartproxy, Bright Data, and Oxylabs have gone through as many as two price cuts in the meantime, the second taking place this March.

Get proxy news and updates directly to your inbox.

The post Infatica Makes Its Residential Proxies Up to 43% Cheaper appeared first on Proxyway.

]]>
https://proxyway.com/news/infatica-residential-price-decrease-april-2024/feed 0
Rayobyte Decreases Residential Proxy Prices, Moves to Subscription https://proxyway.com/news/rayobyte-residential-price-cut-subscription https://proxyway.com/news/rayobyte-residential-price-cut-subscription#respond Fri, 29 Mar 2024 13:01:55 +0000 https://proxyway.com/?post_type=news&p=20694 The new rates are up to 50% cheaper in lower ranges.

The post Rayobyte Decreases Residential Proxy Prices, Moves to Subscription appeared first on Proxyway.

]]>

News

The new rates are up to 50% cheaper in lower ranges. 

Adam Dubois

Rayobyte, the US-based proxy server provider, has revised the pricing model and rates of its residential proxy network. 

First, the provider has switched to subscription-based pricing. While there remains a pay-as-you-go plan without traffic caps, it no longer scales with volume.

Second, Rayobyte has reviewed its rates. Compared to the previous version without discounts, the prices are now up to 50% cheaper. The difference is starkest in lower ranges; at 1 TB, the largest public plan, Rayobyte’s proxies have actually received a minor price increase.

 Pay as you go15 GB50 GB100 GB250 GB500 GB1 TB
New price/GB

$7.5

$6.67

$5

$4.5

$4

$3.5

$3.2

Old price/GB

$15

$12.5

$7

$6

$5

$4

$3

Difference

-50%

-47%

-30%

-25%

-20%

-13%

+7%

After the revision, Rayobyte’s rates are in line with other mid-range residential proxy services. They’re especially competitive between 50 and 250 GB of traffic:

rayobyte residential proxy pricing graph

This is the fourth price decrease this year, following SmartproxyBright Data, and Oxylabs. And it’s likely not the last. We’re running out of synonyms for the word decrease, but that’s a good problem to have in such a context.

Get proxy news and updates directly to your inbox.

The post Rayobyte Decreases Residential Proxy Prices, Moves to Subscription appeared first on Proxyway.

]]>
https://proxyway.com/news/rayobyte-residential-price-cut-subscription/feed 0
Bright Data’s ScrapeCon to Take Place on April 2 https://proxyway.com/news/bright-data-scrapecon-2024 https://proxyway.com/news/bright-data-scrapecon-2024#respond Tue, 26 Mar 2024 07:54:28 +0000 https://proxyway.com/?post_type=news&p=20565 The rescheduled web scraping event will proceed in a week.

The post Bright Data’s ScrapeCon to Take Place on April 2 appeared first on Proxyway.

]]>

News

The rescheduled web scraping event will proceed in a week. 

Adam Dubois
scrapecon promotional image

Bright Data, the Israeli web data platform, has announced the date for ScrapeCon, its virtual web scraping conference. 

The event will take place on April 2, or a week from now. Its allotted time is from 3PM to 7PM CEST. 

These four hours will fit in 10 sessions covering four main topics:

  1. Cloud-based web scraping,
  2. Data for AI,
  3. Scaling scraping, and
  4. The future of web data.


The list of speakers includes Bright Data’s staff, representatives from Kaggle, Rakuten, Ixigo, and Claro Analytics, as well as a selection of tech influencers like Tech with Tim and TiffInTech. 

Registration is free of charge. You can select to participate in some or all of the sessions.

The conference was originally planned for November 7. However, it had to be postponed due to the situation in Israel. 

ScrapeCon joins Zyte’s Extract Summit and Oxylabs’ OxyCon as the third major event dedicated to web data collection.

Get proxy news and updates directly to your inbox.

The post Bright Data’s ScrapeCon to Take Place on April 2 appeared first on Proxyway.

]]>
https://proxyway.com/news/bright-data-scrapecon-2024/feed 0
Oxylabs Cuts Residential Proxy Rates by Up to 20% https://proxyway.com/news/oxylabs-cuts-residential-proxy-rates-20-percent https://proxyway.com/news/oxylabs-cuts-residential-proxy-rates-20-percent#respond Mon, 25 Mar 2024 13:06:24 +0000 https://proxyway.com/?post_type=news&p=20548 The reduction affects three entry plans.

The post Oxylabs Cuts Residential Proxy Rates by Up to 20% appeared first on Proxyway.

]]>

News

The reduction affects three entry plans. 

Adam Dubois

Oxylabs, the Lithuanian provider of data extraction tools and services, has made its residential proxies cheaper for small-scale use. 

The update affects Oxylabs’ three entry plans, cutting their rates by up to 20%. The final amount remains unchanged, meaning that customers will get more traffic for the same price.

 Pay as you goMicro ($99)Starter ($300)
TrafficUp to 50 GB/month13 GB40 GB
New price/GB$8$7.75$7.5
Old price/GB$10$9$8
Difference-20%-16%-6.25%

This move was likely made to counter Bright Data’s recent price decrease. In any case, both providers have become much more approachable, outbidding their premium peers and starting to put pressure on lower market segments.  

oxylabs residential pricing march 2024

This is the third price revision this year so far, following Smartproxy and Bright Data. It feels like there’s still more to come, so let’s wait and see how low the market can go. 

Get proxy news and updates directly to your inbox.

The post Oxylabs Cuts Residential Proxy Rates by Up to 20% appeared first on Proxyway.

]]>
https://proxyway.com/news/oxylabs-cuts-residential-proxy-rates-20-percent/feed 0
Bright Data Reduces Residential Proxy Prices by 20% https://proxyway.com/news/bright-data-reduces-residential-prices-20-percent https://proxyway.com/news/bright-data-reduces-residential-prices-20-percent#respond Fri, 08 Mar 2024 10:01:36 +0000 https://proxyway.com/?post_type=news&p=20039 The second round of price cuts continues.

The post Bright Data Reduces Residential Proxy Prices by 20% appeared first on Proxyway.

]]>

News

The second round of price cuts continues. 

Adam Dubois

Bright Data, the Israeli web data platform, has reduced the prices of its residential proxy network.

The provider’s rates have become 20% cheaper across the board. This includes both subscription plans and the pay-as-you-go option.

 Micro ($10)>$500$500+$1,000+$2,000+
New /GB$5.04$8.40$7.14$6.30$5.88
Old /GB$6.30$10.50$8.92$7.78$7.35
Change-20%

While still not exactly cheap, Bright Data has taken a strong position to compete with other premium providers like NetNut and Oxylabs. However, that’s without enabling premium features like city targeting, which double the price. 

bright data residential price comparison march 2024

This is Bright Data’s second price revision in a year, following the 30% reduction in September. It comes on the heels of Smartproxy’s pricing changes and will likely fuel a new round of discounts in the market. 2024 is shaping to be a great year for proxy shopping. 

Get proxy news and updates directly to your inbox.

The post Bright Data Reduces Residential Proxy Prices by 20% appeared first on Proxyway.

]]>
https://proxyway.com/news/bright-data-reduces-residential-prices-20-percent/feed 0
Zyte Adds AI Scraping Functionality to Its API https://proxyway.com/news/zyte-api-ai-scraping https://proxyway.com/news/zyte-api-ai-scraping#respond Wed, 06 Mar 2024 11:20:08 +0000 https://proxyway.com/?post_type=news&p=19989 The tool can now crawl, unblock, and parse websites using AI and an optional no-code interface.

The post Zyte Adds AI Scraping Functionality to Its API appeared first on Proxyway.

]]>

News

The tool can now crawl, unblock, and parse websites using AI and an optional no-code interface.

Adam Dubois
zyte ai scraping promo
Image source: Zyte

Zyte, the Ireland-based web data extraction platform, has announced the addition of new AI-based functionality to Zyte API. With it, the tool has become a “complete solution for web scraping”, allowing developers to “build new spiders and add data sources in minutes”. 

The solution combines Zyte API and pre-made spider templates accessible through Scrapy Cloud on Zyte’s dashboard. It allows extracting data from websites without creating your own crawling logic, dealing with anti-bot systems, or specifying selectors for data extraction.  

The data extraction step relies on Zyte’s proprietary machine learning model that the company claims to be up to “50x cheaper and 56% more accurate” than large language models like ChatGPT.

For now, the only available template covers e-commerce product pages. Using Scrapy Cloud’s no-code interface, you can quickly create a web scraper by specifying a few parameters: starting URL, geolocation, request count, crawling strategy, and browser rendering. The scraper then automatically crawls the website and returns structured data.

Developers who need more functionality can make quick customizations on the dashboard or use the provided Python library. This way, it’s also possible to build completely new scrapers that make use of Zyte’s AI tech.

The company first introduced its e-commerce template and no-code interface during Extract Summit, Zyte’s annual web scraping conference (you can find our recap here). We’ve tried it several times for small tasks like crawling e-commerce pages and received satisfactory results.

All in all, Zyte’s AI Scraping is a fascinating project that tries to combine the interests of both business folks and seasoned Scrapy developers. For now, the experience is still clunky: it requires two subscriptions (to Zyte API and Scrapy Cloud), parses limited data types, and the user interface can be intimidating. In addition, the API’s pricing isn’t always easy to estimate, as it changes dynamically based on the target. 

Having said that, the underlying tech is, without a doubt, solid, and we’re eager to see how the provider will streamline its product in the future. 

You can try the AI e-commerce template for free by claiming Zyte API’s $5 credit and using Scrapy Cloud’s free tier.

Get proxy news and updates directly to your inbox.

The post Zyte Adds AI Scraping Functionality to Its API appeared first on Proxyway.

]]>
https://proxyway.com/news/zyte-api-ai-scraping/feed 0
Smartproxy Slashes Residential Proxy Prices by Up to 25% https://proxyway.com/news/smartproxy-slashes-residential-proxy-prices-again https://proxyway.com/news/smartproxy-slashes-residential-proxy-prices-again#respond Tue, 05 Mar 2024 14:06:19 +0000 https://proxyway.com/?post_type=news&p=19950 It’s the second price cut in less than a year.

The post Smartproxy Slashes Residential Proxy Prices by Up to 25% appeared first on Proxyway.

]]>

News

It’s the second price cut in less than a year. 

Adam Dubois

Smartproxy, the international provider of proxies and web scraping tools, has reduced the rates of its residential proxy network. It’s also introduced two new enterprise plans.

The price reduction affects all packages and varies between 12% and 25%. What’s interesting is that the changes are more pronounced at 250 GB and up – usually, it’s the entry plans that see the biggest cuts. 

Furthermore, Smartproxy has added additional enterprise plans for two and five terabytes of data. This suggests that the provider is looking to attract more large-scale clients, in addition to making its offering more attractive overall. 

The updated pricing table looks as follows:

TrafficPriceReduction
1 GB (Pay as you go)$7-18%
2 GB$12 ($6/GB) -15%
8 GB$44 ($5.5/GB) -15%
25 GB$130 ($5.2/GB) -13%
50 GB$245 ($4.9/GB) -12%
100 GB$450 ($4.5/GB) -13%
250 GB$1,000 ($4/GB) -16.5%
500 GB$1,750 ($3.5/GB) -20%
1,000 GB$3,000 ($3/GB) -25%
2,000 GB (new)$5,600 ($2.8/GB) –
5,000 GB (new)$11,000 ($2.2/GB) –

Compared to other mid-range providers like IPRoyal or SOAX, Smartproxy’s rates are now among the lowest:

smartproxy residential prices march 2024

This is Smartproxy’s second residential proxy price decrease within a year, the first taking place last May. It prompted a competition with providers like Bright Data, Oxylabs, and SOAX, which reviewed their own rates throughout 2023. Apparently, there’s still more to come. 

Get proxy news and updates directly to your inbox.

The post Smartproxy Slashes Residential Proxy Prices by Up to 25% appeared first on Proxyway.

]]>
https://proxyway.com/news/smartproxy-slashes-residential-proxy-prices-again/feed 0
Meta Drops the Case Against Bright Data https://proxyway.com/news/bright-data-wins-against-meta https://proxyway.com/news/bright-data-wins-against-meta#respond Tue, 27 Feb 2024 11:47:17 +0000 https://proxyway.com/?post_type=news&p=19767 The dispute results in a complete win for the data collection infrastructure provider.

The post Meta Drops the Case Against Bright Data appeared first on Proxyway.

]]>

News

The dispute results in a complete win for the data collection infrastructure provider.

Adam Dubois

Bright Data, the Israeli provider of web data extraction tools and services, has officially won the case against Meta. 

On February 23, Meta filed a motion to dismiss its claim against Bright Data. The motion follows the U.S. district court’s decision a month before, where Judge Edward Chen denied that Bright Data had breached Meta’s contract terms by scraping data off Facebook and Instagram.

The remaining argument involved tortious interference with Meta’s contractual terms and focused on customer use of Bright Data’s services. 

In a LinkedIn post, Bright Data’s CEO Or Lenchner emphasized that the outcome wasn’t the result of a settlement, but rather that Meta dropped the case conceding its loss. The motion also means that Facebook’s owner won’t be able to file for an appeal. In other words, it’s a complete win for Bright Data.

The broader web scraping industry received an important clarification that scraping Facebook and Instagram is contractually illegal only if done through a logged in account used for this purpose. This rules out far-fetched and borderline ridiculous arguments that merely having a company account creates an obligation not to scrape the platforms whatsoever. 

Having said that, the court’s line of reasoning focused on Meta’s terms in particular. At this point, it would still be dangerous to extend it to all cases of similar nature. One of them is currently taking place with the same Bright Data and Elon Musk’s X. Corp. 

In any case, this is a big win for Bright Data and the whole web data collection community. Well done.

Get proxy news and updates directly to your inbox.

The post Meta Drops the Case Against Bright Data appeared first on Proxyway.

]]>
https://proxyway.com/news/bright-data-wins-against-meta/feed 0
NetNut Launches Website Unblocker, SERP API, Professional Datasets https://proxyway.com/news/netnut-unblocker-serp-api-datasets https://proxyway.com/news/netnut-unblocker-serp-api-datasets#respond Mon, 19 Feb 2024 12:27:58 +0000 https://proxyway.com/?post_type=news&p=19498 The Israeli proxy provider moves up the data extraction value chain.

The post NetNut Launches Website Unblocker, SERP API, Professional Datasets appeared first on Proxyway.

]]>

News

The Israeli proxy provider moves up the data extraction value chain. 

Adam Dubois

NetNut, the Israeli provider of proxy servers, has officially expanded its product line-up with two web scraping tools and two datasets. 

NetNut first previewed the new products back in October 2023. Four months later, they’re officially available for purchase.

Website Unblocker

netnut website unblocker marketing page

NetNut’s first new product is a general-purpose web scraper, likely a proxy API. Like other products in this category, it combines proxy servers with anti-detection technology to simplify web scraping operations. The tool handles fingerprint management, automatic proxy rotation, and CAPTCHA solving.

Thus far, there’s little concrete information about Website Unblocker’s implementation and pricing. That said, NetNut offers the option to try it out for free.

SERP Scraper API

Netnut Google SERP Scraper API main information

NetNut’s second tool is a Google scraper. It returns search engine result pages in raw (HTML) or parsed (JSON) formats. The API supports city-level localization, pagination, device selection, and several other parameters. Some more advanced features, like filtering by search type, seem to be missing at this time. 

With pricing that starts from $1,200, SERP Scraper API is meant for enterprise use:

 ProductionSemi-ProProfessionalMaster
Requests1 million10 million25 million100 million
Price$1,200$7,200$25,000$40,000
Price/1,000 requests$1.2$0.72$0.5$0.4

Professional Datasets

Netnut professional datasets information

NetNut’s final product includes datasets from what we assume is the largest professional network (LinkedIn). You can buy information about 650 million user profiles or an unspecified number of companies.

The marketing page mentions both real-time data and scheduled delivery, which makes it unclear what exactly you’re getting. It seems like the product takes the form of an API. It delivers data in structured formats like JSON and CSV, which can be stored in popular cloud services. 

At the moment, there’s no information about the pricing or what the output looks like. To learn more about it, you’ll have to contact NetNut’s sales team.

Bottom Line

NetNut starts the year strong and further increases the tightening competition in the web scraper space. With full focus from its parent company Alarum and having achieved record revenue numbers in Q4, the provider is determined to position itself as one of the market leaders in 2024.

Get proxy news and updates directly to your inbox.

The post NetNut Launches Website Unblocker, SERP API, Professional Datasets appeared first on Proxyway.

]]>
https://proxyway.com/news/netnut-unblocker-serp-api-datasets/feed 0