Tech Lead - Web Scraping and Crawling

 (5+ years exp)
Published: 1 month ago
Avatar for AdvaRisk

AdvaRisk

A B2B fraud detection and decision support platform for financial institutions

Job Location

Job Type

Full Time

Visa Sponsorship

Not Available

Relocation

Allowed

Skills

Python
Java
HTML
CSS
Django
Linux
Scrapy
Beautiful Soup
AWS

The Role

KEY RESPONSIBILITIES:
Manage individual projects priorities, deadlines, and deliverables
Gather and process raw data at scale (including writing scripts, web scraping, calling/create APIs, etc.) from the web / internet
Develop frameworks for automating and maintaining constant flow of data from multiple sources
Identify, analysis, design, and implement internal process improvements
Design and implement tooling upgrades to increase stability and data quality
Help team to fix issues that occur in test and production environments
Automate software development processes, including build, deploy, and test
Mange and guide the team members

REQUIRED QUALIFICATIONS:
4+ years of web crawling/ scraping experience is a must
Strong knowledge of scraping frameworks such as Scrapy, Beautiful Soup, HTQL, Jsoup, Web-Harvest and others
Excellent verbal, written, and interpersonal communication skills in English
Good to have Experience of complex crawling (like captcha, Mobile OTP based crawling, bypassing proxy)
Sound Knowledge in Bot Management Techniques
Experience in various data extraction methods (like data extraction from PDF Files, web pages, etc)
Good understanding of HTML DOM, CSS, Javascript, and RESTful web services
Good to have understanding of AWS
Experience with Linux
Experience with Java / Python

More about AdvaRisk

Funding

AMOUNT RAISED
$700K
FUNDED OVER
1 round
Round
S
$700,000
Seed Sep 2019
image

Founders

Rahul Metkar
Founder • 3 years
image
Vishal Sharma
Founder • 3 years
Mumbai
image
Go to team image

Similar Jobs

Revofit company logo
Revofit
Your pocket guide to holistic health
Thrive company logo
Thrive
Online ordering platform empowering restaurants to reduce their dependence on aggregators
DronaHQ company logo
DronaHQ
Build enterprise grade mobile apps in no time
LogiNext company logo
LogiNext
SaaS for Delivery and Transportation Business
Growfitter company logo
Growfitter
Incentivised Wellness Platform that rewards you to Grow Healthy Grow Happy & Grow Fitter
SwitchMe Technologies company logo
SwitchMe Technologies
Making mortgage refinance easy in India
Beyond Enough company logo
Beyond Enough
An Interactive Lifestyle Experience Platform (a Luxury Ecosystem). Live Life Out Of Sync
Gupshup Technology company logo
Gupshup Technology
Enable engaging conversations seamlessly across 30+ channels using a Single API