Web Scraper

 (1+ years exp)
₹3L – ₹8L
Published: 1 month ago
Avatar for Karza Technologies

Karza Technologies

Building products to detect fraud and risk instances using big data and machine learning

Job Location

Job Type

Full Time

Visa Sponsorship

Not Available

Relocation

Allowed

Skills

Python Web Scraping (Beautiful Soup/Scrapy)

The Role

At Karza technologies, our DI (Data Ingestion) team works with frameworks like apache airflow to make tasks fully automated from source to database. We are collecting data and legal documents from publicly available government data sources. Our web scrapers work with kscrapy framework that is developed by karza developers and it is fully automated from scraping to parsing. Our engineers work with cloud micro services like lambda and dockers to deploy API’s on the collected data after applying business intelligence and analytics.

A few recognitions:
• Recognized as Top25 startups in India to work with 2019 by LinkedIn
• Winner of HDFC Bank's Digital Innovation Summit 2020
• Super Winners (Won every category) at Tecnoviti 2020 by Banking Frontiers
• Winner of Amazon AI Award 2019 for Fintech
• Winner of FinTech Spot Pitches at Fintegrate Zone 2018 held at BSE
• Winner of FinShare 2018 challenge held by ShareKhan
• Only startup in Yes Bank Global Fintech Accelerator to win the account during the Cohort
• 2nd place Citi India FinTech Challenge 2018 by Citibank
• Top 3 in Viacom18's Startup Engagement Programme VStEP

What your average day would look like:
• As a Python Developer, your role is to apply your knowledge set to fetch data from multiple online sources, cleanse it and build APIs on top of it.
• Develop a deep understanding of our vast data sources on the web and know exactly how, when, and which data to scrap, parse and store this data.
• Work closely with Database Administrators to store data in SQL and NoSQL databases
• Develop frameworks for automating and maintaining constant flow of data from multiple sources.
• Work independently with little supervision to research and test innovative solutions Skills

Skills required:
• Strong coding experience in Python (knowledge of Java, Javascripts is a plus)
• Experience with SQL and NoSQL databases
• Experience with multi-processing, multi-threading, and AWS/Azure.
• Strong knowledge of scraping frameworks such as Python (Request, Beautiful Soup), Web Harvest and others
• Depth knowledge of algorithms and data structures & previous experience with web crawling is a must

Experience Required:
• 1 - 3 years of relevant experience

Similar Jobs

Thrive company logo
Thrive
Online ordering platform empowering restaurants to reduce their dependence on aggregators
Crimson Interactive company logo
Crimson Interactive
AI powered literature analysis and comprehension tool for researchers
SM Global company logo
SM Global
We are an integrated Marketing and Development Agency in Mumbai, India
LogiNext company logo
LogiNext
SaaS for Delivery and Transportation Business
Freightwalla company logo
Freightwalla
Building intelligent services for the international shipper
Clinicia company logo
Clinicia
Cloud based Practice Management SaaS for Doctors