Data Wrangler Extraordinaire

 (3+ years exp)
Published: 1 month ago
Avatar for Weave.bio

Weave.bio

The AI operating system for biotech companies

Job Location

Remote • 
Australia • 
Europe • 
Africa • 
Asia • 

Job Type

Full Time

Visa Sponsorship

Available

Remote Work Policy

Remote only

Hires remotely

Everywhere

Preferred Timezones

Pacific Time, Mountain Time, Central Time, Eastern Time

Relocation

Allowed

Skills

Python
MongoDB
XML
Redis
JSON
Neo4J
Pandas
Spark
Apache Spark
Apache Airflow

The Role

Do you enjoy taming data and making it work for you? Are you a Python pro who loves to work with noSQL and graph databases while orchestrating workflows with tools like Apache Airflow? Then, you might be our next Data Engineer!

We're looking for a data wrangler who can take on anything from simple JSON files to complex XML formats and turn them into actionable insights. You'll work with Spark to distribute and process data, building pipelines that are as efficient as they are elegant. You'll ensure that our data lineage is transparent and well-documented with OpenLineage. In short, you'll be a data superhero - swooping in to save the day with your analytical prowess and technical wizardry.

Key Responsibilities:

  • As first data engineer, devise data architecture and ultimately lead data engineering team
  • Lassoing and taming unruly data with Python and Pandas
  • Orchestrating workflows with Apache Airflow (because wrangling data is a lot like herding cats)
  • Building robust data storage solutions with noSQL databases like Redis and MongoDB and graph databases like Neo4j
  • Shaping and molding data from various file formats including JSON and XML into usable insights
  • Using Spark to distribute and process data at scale
  • Ensuring data lineage is transparent and traceable with OpenLineage

Key Requirements:

  • 10+ years of experience in data engineering
  • Experience working in agile teams required; experience at a startup strongly preferred
  • A love for all things data and a knack for turning it into insights
  • Python is your bread and butter, and you know your way around Pandas like the back of your hand
  • You're an Apache Airflow expert, because you know that wrangling data is a lot like herding cats
  • You're familiar with noSQL databases like Redis and MongoDB and graph databases like Neo4j
  • You're comfortable working with various file formats including JSON and XML
  • You're a Spark wizard, because scaling data processing is a breeze for you
  • You have experience with data lineage frameworks like OpenLineage
  • You're a problem-solver with excellent communication skills (because saving the day often requires explaining what you did to the rest of the team).

If you're ready to don your cape and take on the world of data, we want to hear from you! Apply today and show us your data wrangling superpowers! (NB: Complete this coding challenge -- https://github.com/weavebio/data-engineering-coding-challenge -- and add the link to in this application.)

More about Weave.bio

Founders

Shlomo Klapper
Founder • 3 years
image
Go to team image

Similar Jobs

NetVirta company logo
NetVirta
The most accurate, smartphone based 3D body scanning app and end-to-end platform
KGtoPG  "Dot eVentures Pvt Ltd" company logo
KGtoPG "Dot eVentures Pvt Ltd"
Our vision is to enable the improvement of educational outcomes around the world
Bread company logo
Bread
Transforming retail, unlocking growth
Moka company logo
Moka
Operating Platform for Businesses
SquareFoot company logo
SquareFoot
Find office space. View comprehensive listings tailored to your company brand and culture
deepPIXEL company logo
deepPIXEL
deepPiXEL is an AI platform that uses AI to help companies and humans
Asgard.ai company logo
Asgard.ai
Get qualified leads matching your ideal customer profile
MediaMelon company logo
MediaMelon
B2B SaaS Data Platform for Video Streaming (OTT) Analytics