- Early StageStartup in initial stages
Data Science Intern (web scrapper)
- ₹60,000 – ₹80,000 • No equity
- Remote •
- 1 year of exp
- Internship
Reposted: 4 months ago• Recruiter recently active
Visa Sponsorship
Not Available
Remote Work Policy
Remote only
Hires remotely in
RelocationNot Allowed
Skills
Python
HTML
SQL
MySQL
MongoDB
REST APIs
Data Management
MLOps
Large Language Models (LLMs)
About the job
As a Web Scraping focused Data Engineer, you will be responsible for extracting and ingesting data from websites/other sources using web crawling tools. In this role you will own the creation process of these tools, services, and workflows to improve crawl/ scrape analysis, reports and data management. We will rely on you to test the data and the scrape to insure accuracy and quality. You will own the process to identify and rectify any issues with breaks as well as scale scrapes as needed.
What's required
- Experience running large scale web scrapes
- Solid Python knowledge
- Familiarity with Mongodb, feature engineering
- Familiarity with techniques and tools for crawling, extracting and processing data (e.g. Scrapy, pandas, mapreduce, SQL, BeautifulSoup, etc).
- Experience with system monitoring/administration tools.
- Experience with applications of LLMs.
- Experience with version control, open source practices, and code review
- Experience with applications designed to display archived web content
- Great communication skills (written and Spoken in English)
About the company
Similar Jobs
Cypherock Wallet
Personal Fort Knox for Your Crypto
Xplorazzi
Using AI of Retail Shelf Images for merchandising audit
CloudAEye
Intelligent Cloud Operations
ClearFeed
Conversational Support Platform for Slack and Teams
Rockmetric
Intelligent AI-powered Business Analytics Platform
| Networth Corp |
Fast-tracking of global problem solving and value generation from innovation
Bobble AI Technologies
World's first Conversation Media Platform, enriching everyday conversations!
Fintricity
We're a venture studio, providing consulting and building ventures, and helping scaleups
Houseware
AI-first Digital Experience Platform