Proxy locations

Europe

North America

South America

Asia

Africa

Oceania

See all locations

Network status Careers

hello@oxylabs.io

English (EN)

English

中文

Proxies

Proxies & Advanced Proxy Solutions

Residential Proxies

Human-like scraping without IP blocking

Mobile Proxies

Harness the power of IP addresses from real mobile devices

Rotating ISP Proxies

Extract the required data without the fear of getting blocked

Web Unblocker

AI-powered proxy solution for block-free scraping

Shared Datacenter Proxies

Fast and reliable proxies for cost-effective scraping

Dedicated Datacenter Proxies

The highest performing proxies on the market

Static Residential Proxies

Combined power of Datacenter and Residential IPs

Tools & Addons

Oxy Proxy Extension for Chrome

Free Chrome proxy manager extension that works with any proxy provider.

Oxy Proxy Manager for Android

Free Android proxy manager app that works with any proxy provider.

Proxy RotatorAdd-on

Rotates your Datacenter Proxies to help increase success rates.

Scraper APIs

SERP Scraper APIFREE TRIAL

Scalable SERP data delivery from major search engines

E-Commerce Scraper APIFREE TRIAL

Enterprise-level data from largest e-commerce marketplaces

Real Estate Scraper APIFREE TRIAL

Real-time data from popular real estate websites

Web Scraper APIFREE TRIAL

Public data delivery from a majority of websites

Features

Web Crawler

Discovers all pages on a website and fetches data at scale.

Scheduler

Schedules multiple scraping and parsing jobs at specified frequencies.

Custom Parser

Parses scraped documents by executing given parsing instructions.

Headless BrowserNEW

Render JavaScript and execute browser instructions.

DatasetsNew

Datasets

Company Data

Comprehensive datasets for business profiling

E-Commerce Product Data

Datasets for product catalog insights from E-Commerce stores

Job Postings Data

Datasets for labour market research and insights

Community and Code Data

Datasets for developer community trends

Product Review Data

Fresh datasets for user sentiment analysis

Pricing

Proxies

Residential Proxies

Human-like scraping

Starts from

$10

Pay as you go

Mobile Proxies

3G/4G/5G Mobile Proxies

Starts from

$22

Pay as you go

Rotating ISP Proxies

Extended sessions

Starts from

$340/month

Shared Datacenter Proxies

Cost-effective solution

Starts from

$50/month

Dedicated Datacenter Proxies

Superior performance

Starts from

$50/month

Scraper APIs

SERP Scraper API

Scalable SERP data delivery

Starts from

$49/month

E-Commerce Scraper API

Enterprise-level product page data

Starts from

$49/month

Web Scraper API

Data from a majority of websites

Starts from

$49/month

Real Estate Scraper API

Real-time real estate data

Starts from

$49/month

Advanced Proxy Solutions

Web Unblocker

AI-powered proxy solution

Starts from

$75/month

Learn

Getting Started

Knowledge Base

Read the latest articles about the world of web scraping, proxies, and more

Webinars

Check our webinars to learn more about data gathering issues and solutions

White papers

Get extensive white papers to understand the most complex scraping topics

OxyCon

Join inspiring discussions at Oxylabs’ annual web scraping conference

Scraping Experts

Watch lessons by industry-leading experts to gain insights on data gathering

Useful Information

Quick Start Guides

Featured

Explore tutorials and code samples to build a web scraping infrastructure with Oxylabs solutions.

Solutions

By Industry

E-Commerce

Get access to valuable e-commerce data with the help of advanced scraping solutions

Cybersecurity

Collect threat intelligence and inspect risky activities anonymously with reliable proxies

Brand protection

Monitor the web on a large scale to ensure no unauthorized product seeped into the market

SERP Monitoring

Monitor SERPs to enhance your business strategy

Travel and hospitality

Gather real-time flight and hotel data to and build a solid strategy for your travel business.

By Use Case

View all

By Target

View all

Back to blog

OxyCon Events Sustainability

OxyCon 2022: The Top Takeaways From Day Two

Yelyzaveta Nechytailo

2022-09-094 min read

OxyCon 2022 has officially come to an end, which means it’s time to take a look at some of the biggest highlights from Day Two!

Just like the conference’s Day One, yesterday’s live sessions were able to immerse us in the most trending topics of the web scraping industry as well as offer new perspectives on the things we thought we already knew a lot about.

So, make sure you’ve marked all the event’s key points by checking out the blog post below.

A Crash Course in Machine Learning for Text Using Web Data

The day started with a presentation on Machine Learning (ML) – an increasingly popular discipline of AI essential for evaluating data and making predictions with almost zero human intervention. Allen O’Neill, CEO/CTO at The DataWorks, explored how to use ML to turn text-based web data into valuable information-rich insights using open-source tools and technologies.

One of the most valuable ideas from Allen’s presentation – we need to harness the power of information extraction, not data extraction. Information has structure, value, and, with the usage of Natural Language Processing (NLP), it can be broken down into small parts for matching and meaning extraction.

Named entity recognition, words as numbers, part of speech, proximity search – all these NLP techniques should be utilized in synergy to handle ambiguity and get:

Essential market insight
Product discovery
Issue identification
New product ideas

How Data Scraping and Creative Algorithms Can Lead to Exciting Products

Then, we continued the day with another external speaker Karsten Madsen, CEO at Morningscore. In this talk, Karsten decided to focus on their own example of building a company in the ever changing and demanding web scraping land.

When trying to succeed in the market, Morningscore had to face multiple challenges, such as:

A huge number of competitors
Ahrefs claim to have 30% of Google's server capacity (which basically meant they are competing with an industry giant)
A need to organize billions of data at high speed and with amazing accuracy

Despite all that, Karsten and his team were able to find their own way of standing a chance. They partnered with the best data suppliers to get the needed public data quickly and with less costs. And instead of being an underdog, turned to creativity and gamification for smarter data presentation and enhanced user experience.

Overall, this was an ultimate story of how weaknesses can be turned into strengths.

Observability and Web Scrapers: Filling the Unknown Void

Knowing your scraper is the beginning of all wisdom – this is the phrase Martynas Saulius, Python Developer at Oxylabs, used to start his presentation. And due to its noticeable interconnection with the famous saying by Aristotle, this sentence immediately grabbed the attention of every viewer.

In the presentation itself, Martynas presented the effective observability trifecta, Logs, Metrics, & Tracing, and even gave a detailed explanation why Metrics is his personal favorite pillar of observability. He discussed its types, gathering methods, and highlighted one of the important Metrics’ benefits – the ability to make your system autonomous.

The speech concluded with another memorable phrase – knowledge is power. By using all the observability pillars you can gather essential wisdom to improve your tooling and become more powerful.

Practical Application of Common Web Scraping Techniques

The knowledge-sharing day continued with a presentation from Eivydas Vilčinskas, Technical Team Lead at Oxylabs and a regular expert at OxyCon web scraping conference since 2019.

This time, Eivydas decided to demonstrate a practical introduction to the wide field of scraping and share some important tips he came up with during the years of technical experience, starting from session preparation to data parsing.

To make sure you can use all the mentioned tips for your own web scraping projects, we’ve shortly noted all of them below.

Tip 1. Browserless is faster than headless

Tip 2. Change the “User-Agent” header

Tip 3. Prepare your session

Tip 4. Re-use session parameters

Tip 5. Use proxies

Tip 6. The Developer Tools, your best friend

Tip 7. API over HTML

Tip 8. Use lxml with XPath

Tip 9. Queues make your system robust

For your convenience, you can also check out the code samples shared by Eivydas during the presentation by following this link.

Data Collection: Orchestration, Observability, and Introspection

Another presentation covering the general information about web scraping and its related processes was delivered by Paul Morgan, Data Collections Team Lead at Datasembly. By breaking down his talk into separate sections, such as WTF of data collection, orchestration, observability, and introspection, he was able to keep the viewers interested throughout the whole session as well as deliver enough details on every topic.

Particular attention was, of course, given to job observability as an essential concept to get insights into the whole infrastructure. Paul Morgan shared the different ways they perform job observability in their company, which obviously set a great example for the viewers looking to improve this aspect of their business.

So, according to Paul, observability can be achieved by:

Keeping track of different steps throughout the process
Noticing the various issues that occur
Performing automatic monitoring that allows to know if something wrong is happening sooner than later.

Web Scraping at Scale With Quality and Compliance

A chance to close this year’s web scraping conference was given to Sarah McKenna, CEO at Sequentum and an experienced engineering manager running all kinds of automated operations.

As highlighted by the speaker, there are a lot of problems for large scale web data extraction projects (sites change, errors happen, unexpected edge cases arise); however, it doesn’t mean that these issues will make the process fail. That’s why in her presentation, Sarah chose to focus on covering every little aspect that can make public data extraction at scale not only possible but also successful.

The expert shared all the tiny details you have to keep your eye on, explained how to define, measure and track KPIs for public web data extraction operation, and even explained how to handle key compliance concerns and mitigate legal risks.

As you see, OxyCon Day Two was all about sharing valuable knowledge. It was intense, entertaining, technical, and hopefully encouraged you to to look at the discussed topics from different angles.

See you at OxyCon 2023!

About the author

Yelyzaveta Nechytailo

Senior Content Manager

Yelyzaveta Nechytailo is a Senior Content Manager at Oxylabs. After working as a writer in fashion, e-commerce, and media, she decided to switch her career path and immerse in the fascinating world of tech. And believe it or not, she absolutely loves it! On weekends, you’ll probably find Yelyzaveta enjoying a cup of matcha at a cozy coffee shop, scrolling through social media, or binge-watching investigative TV series.

Learn more about Yelyzaveta Nechytailo

All information on Oxylabs Blog is provided on an "as is" basis and for informational purposes only. We make no representation and disclaim all liability with respect to your use of any information contained on Oxylabs Blog or any third-party websites that may be linked therein. Before engaging in scraping activities of any kind you should consult your legal advisors and carefully read the particular website's terms of service or receive a scraping license.

OxyCon Events