Avatar for Wynd Labs
Wynd Labs
Actively Hiring
Making AI Data Accessible. Building a suite of products powered by Grass

Web Scraping Specialist

Reposted: 1 month ago
Visa Sponsorship

Not Available

Remote Work Policy

Remote only

Hires remotely in
Preferred Timezones
Eastern Time
RelocationNot Allowed
Skills
Python
Javascript
Machine Learning
Data Analysis
noSQL
MongoDB
Algorithms
Cassandra
Web Scraping
Selenium
DOM
HTML/CSS/Javascript
Azure
AWS Cloud Services
Data Management
Python Web Scraping (Beautiful Soup/Scrapy)

About the job

Web Scraping Specialist

$70k – $140k

Who We Are.

Wynd Labs is an early-stage startup that is on a mission to make public web data accessible for AI through contributions to Grass.

Grass is a network sharing application that allows users to share their unused bandwidth. Effectively, this is a residential proxy network that directly rewards individual residential IPs for the bandwidth they provide. Grass will route traffic equitably among its network and meter the amount of data that each node provides to fairly distribute rewards.

In non-technical terms: Grass unlocks everyone's ability to earn rewards by simply sharing their unused internet bandwidth on personal devices (laptops, smartphones).

This project is for those who lead with initiative and seek to challenge themselves and thrive on curiosity.

We operate with a lean, highly motivated team who revel in the responsibility that comes with autonomy. We have a flat organizational structure, the people making decisions are also the ones implementing them. We are driven by ambitious goals and a strong sense of urgency. Leadership is given to those who show initiative, consistently deliver excellence and bring the best out of those around them. Join us if you want to set the tone for a fair and equitable internet.

The Role.

We are seeking a Web Scraping Specialist who is proficient and brings significant experience in data extraction and web scraping techniques. You will join a small, specialized team and lead efforts to gather and analyze data, optimize scraping processes, and support our vision for a future where Grass plays a crucial role in transforming internet data accessibility.

Who You Are.

  • Demonstrated ability to extract data from complex websites with minimal supervision, with a portfolio or examples of past projects.
  • Proficiency in languages such as Python or JavaScript, with strong skills in libraries and frameworks like BeautifulSoup, Scrapy, or Selenium.
  • Knowledge of asynchronous programming, multithreading, and distributed scraping.
  • In-depth knowledge of HTML, CSS, JavaScript, and the Document Object Model (DOM).
  • Experience with NoSQL databases (MongoDB, Cassandra), capable of designing efficient storage solutions and managing data integrity.
  • Ability to apply machine learning algorithms for data cleaning, categorization, or predictive analysis adds significant value.
  • Experience with cloud services (AWS, Google Cloud, Azure) for deploying and managing scraping jobs at scale.
  • Active participation in open-source projects related to web scraping, data processing, or similar fields.

What You'll Be Doing.

  • Write, test, and refine code that extracts data from various online sources, ensuring reliability and efficiency.
  • Perform data retrieval tasks, handling complexities such as pagination and dynamic content loaded with AJAX.
  • Clean and format extracted data, ensuring it meets quality standards for further analysis or processing.
  • Database management: Store and manage the scraped data in appropriate databases, optimizing for access speed and data integrity.
  • Regularly monitor the scraping processes, identify and resolve any issues to maintain continuous data flow.

Why Work With Us.

  • Opportunity. We are at at the forefront of developing a web-scale crawler and knowledge graph that allows ordinary people to participate in the process, and share in the benefits of AI development.
  • Culture. We’re a lean team working together to achieve a very ambitious goal of improving access to public web data and distributing the value of AI to the people. We prioritize low ego and high output.
  • Compensation. You’ll receive a competitive salary and equity package.
  • Resources and growth. We’re well-capitalized, with backing from leading venture funds like Polychain, Tribe, NLH, Hack, BH Digital, and more. We keep a lean team, and this is a rare opportunity to join. You’ll learn a lot and grow as our company scales.

About the company

Wynd Labs company logo

Wynd Labs

Actively Hiring
Making AI Data Accessible. Building a suite of products powered by Grass11-50 Employees
Learn more about Wynd Labs image

Funding

AMOUNT RAISED
$4.5M
FUNDED OVER
2 rounds
Rounds
S
$3,500,000
Seed - Jan 2024+1

Founders

Christopher Nguyen
Founder • 3 years
New York City
image
View the team image

Similar Jobs

Adelaide company logo
Adelaide
Measure media quality using attention metrics
Spero Institute company logo
Spero Institute
Optimizing virtual intensive mental healthcare
Archesys company logo
Archesys
Improving the government services that impact everyday lives
Wynd Labs company logo
Wynd Labs
Making AI Data Accessible. Building a suite of products powered by Grass