Web Scraping Specialist
$75K$150K
Worldwide
auto-extracted
More jobs from this company
views: 0
Web Scraping Specialist

Overview

Description

Location: Remote with a 6 hour overlap with EST

Remote | Full-time

Compensation: $75K – $150K

We are hiring on behalf of our client who is seeking a Web Scraping Specialist to join a specialized technical team focused on building the infrastructure that delivers massive amounts of web data for the training of advanced AI models. This organization operates a massive distributed crawler and manages complex pipelines for ingesting, segmenting, and annotating billions of data points, including videos, transcripts, and audio files.

The successful candidate will lead efforts to gather and analyze data, optimize scraping processes, and support the scaling of high-quality public web data accessibility. This role is ideal for a lean, technical builder who thrives in a fast-paced environment without bureaucratic red tape.

Key Responsibilities:

  • Code Development: Write, test, and refine high-performance code to extract data from various online sources, ensuring maximum reliability and efficiency.
  • Data Retrieval: Manage complex data retrieval tasks, including handling pagination and dynamic content loaded via AJAX.
  • Data Quality: Clean and format extracted data to ensure it meets rigorous quality standards for downstream analysis and processing.
  • Database Management: Store and manage scraped data in appropriate databases, optimizing for both access speed and long-term data integrity.
  • Monitoring and Maintenance: Regularly monitor scraping processes and infrastructure to identify and resolve issues, ensuring a continuous and stable data flow.

Tagged as: AI, JavaScript, machine learning, mid-level, MongoDB, noSQL, Python, senior

Web Scraping Specialist
$75K$150K
Worldwide
auto-extracted
More jobs from this company
views: 0

Be the first to know about
new jobs every week

Get 8 new jobs with salaries, once per week! Sign up here so you don't miss a single newsletter.