Data Wrangler Extraordinaire
(3+ years exp)Weave.bio
Job Type
Full TimeVisa Sponsorship
AvailableRemote Work Policy
Remote onlyHires remotely
Preferred Timezones
Relocation
AllowedSkills
The Role
Do you enjoy taming data and making it work for you? Are you a Python pro who loves to work with noSQL and graph databases while orchestrating workflows with tools like Apache Airflow? Then, you might be our next Data Engineer!
We're looking for a data wrangler who can take on anything from simple JSON files to complex XML formats and turn them into actionable insights. You'll work with Spark to distribute and process data, building pipelines that are as efficient as they are elegant. You'll ensure that our data lineage is transparent and well-documented with OpenLineage. In short, you'll be a data superhero - swooping in to save the day with your analytical prowess and technical wizardry.
Key Responsibilities:
- As first data engineer, devise data architecture and ultimately lead data engineering team
- Lassoing and taming unruly data with Python and Pandas
- Orchestrating workflows with Apache Airflow (because wrangling data is a lot like herding cats)
- Building robust data storage solutions with noSQL databases like Redis and MongoDB and graph databases like Neo4j
- Shaping and molding data from various file formats including JSON and XML into usable insights
- Using Spark to distribute and process data at scale
- Ensuring data lineage is transparent and traceable with OpenLineage
Key Requirements:
- 10+ years of experience in data engineering
- Experience working in agile teams required; experience at a startup strongly preferred
- A love for all things data and a knack for turning it into insights
- Python is your bread and butter, and you know your way around Pandas like the back of your hand
- You're an Apache Airflow expert, because you know that wrangling data is a lot like herding cats
- You're familiar with noSQL databases like Redis and MongoDB and graph databases like Neo4j
- You're comfortable working with various file formats including JSON and XML
- You're a Spark wizard, because scaling data processing is a breeze for you
- You have experience with data lineage frameworks like OpenLineage
- You're a problem-solver with excellent communication skills (because saving the day often requires explaining what you did to the rest of the team).
If you're ready to don your cape and take on the world of data, we want to hear from you! Apply today and show us your data wrangling superpowers! (NB: Complete this coding challenge -- https://github.com/weavebio/data-engineering-coding-challenge -- and add the link to in this application.)