Avatar for VCBay
We are an all-in-one networking platform for Startups and Investors
  • Early Stage
    Startup in initial stages

Data Science Intern (web scrapper)

  • ₹60,000 – ₹80,000 • No equity
  • Remote • 
  • 1 year of exp
  • Internship
Reposted: 4 months ago• Recruiter recently active
Visa Sponsorship

Not Available

Remote Work Policy

Remote only

Hires remotely in
RelocationNot Allowed
Skills
Python
HTML
SQL
MySQL
MongoDB
REST APIs
Data Management
MLOps
Large Language Models (LLMs)

About the job

As a Web Scraping focused Data Engineer, you will be responsible for extracting and ingesting data from websites/other sources using web crawling tools. In this role you will own the creation process of these tools, services, and workflows to improve crawl/ scrape analysis, reports and data management. We will rely on you to test the data and the scrape to insure accuracy and quality. You will own the process to identify and rectify any issues with breaks as well as scale scrapes as needed.

What's required

  • Experience running large scale web scrapes
  • Solid Python knowledge
  • Familiarity with Mongodb, feature engineering
  • Familiarity with techniques and tools for crawling, extracting and processing data (e.g. Scrapy, pandas, mapreduce, SQL, BeautifulSoup, etc).
  • Experience with system monitoring/administration tools.
  • Experience with applications of LLMs.
  • Experience with version control, open source practices, and code review
  • Experience with applications designed to display archived web content
  • Great communication skills (written and Spoken in English)

About the company

VCBay company logo
We are an all-in-one networking platform for Startups and Investors11-50 Employees
Company Size
11-50
Company Type
SaaS
Company Industries
Startups
Company Industries
VC Firm
  • Early Stage
    Startup in initial stages
Learn more about VCBay image

Similar Jobs

ClearFeed company logo
ClearFeed
Conversational Support Platform for Slack and Teams
| Networth Corp | company logo
| Networth Corp |
Fast-tracking of global problem solving and value generation from innovation
Bobble AI Technologies company logo
Bobble AI Technologies
World's first Conversation Media Platform, enriching everyday conversations!
Fintricity company logo
Fintricity
We're a venture studio, providing consulting and building ventures, and helping scaleups