Avatar for Flow Labs

Cleaner, clearer, safer roads for everyone

Data Engineer

$90k – $125k • 0.1% – 0.4%
Apply now

(Oakland, CA)

Our Mission:
To eliminate congestion, improve traffic flow and give back time to everyone.

Role Mission:
Your main mission is to build the data infrastructure that enables us to collect, store and efficiently process traffic and mapping data for every road and every road user in the US. The data which you provide enables our Data Science team to accurately model traffic patterns allowing us to optimize traffic signals. Your work will impact millions of people each day: with better traffic signal timing we will lower travel times by up to 25%, reduce emissions by up to 22% and enable everyone to spend more time on the things that are most important to them and with the people who are most important to them.

Key Responsibilities:
You will be responsible for creating the production software to run our data pipelines to feed the data into our Modelling, Simulation and Optimization platform. The pipeline will do aggregations, data cleaning, and transformations and must be self-healing and scalable data pipeline.
You will be managing multiple data types including time series data, spatial data, relational data, and blob store data.
You will create automation tools, as well as create processes for data monitoring and anomaly detection.
You will ensure that the pipeline are testable, reliable and follow good data engineering practices.
The Data Science and Machine Learning teams will be your internal customers. You will work closely with them to understand their expectations and meet or exceed them.
You will evaluate data providers and partners to help our team select the best data sources for the needs of the team.

Required Skills:
BS / MS in Mathematics, Computer Science or an Engineering discipline from a top university.
3+ years of experience in production software engineering using languages such as Python, Java, C++, etc
3+ years of experience in SQL/relational databases and 1+ year with wide column store (eg DynamoDB, PostgreSQL)
Solid understanding of relational concepts and pros and cons of using each type of data store
Experience building high-performance batch and real-time data processing pipelines (MapReduce, Hadoop, HBase/Cassandra, Spark, Samza etc)
Experience with AWS

Experience building and managing large-scale geospatial data systems (mapping, navigation, GIS, routing, fleet management).

Flow Labs at a glance

Cleaner, clearer, safer roads for everyone

Flow Labs focuses on Artificial Intelligence, Transportation, and Deep Learning. Their company has offices in San Francisco and Oakland. They have a small team that's between 1-10 employees.

You can view their website at http://www.flowlabs.ai or find them on Twitter and LinkedIn.

More jobs at Flow Labs

View all jobs

Software Engineer - Full-Stack

Similar jobs to Data Engineer at Flow Labs

Avatar for Fabric Genomics
Global healthcare platform for genomics-driven precision medicine, proven AI algorithms
Avatar for PolySign
Supporting the full spectrum of digital assets to scale to trillions under management
Avatar for Zesty.ai
Climate Risk Analytics Platform powered by Artificial Intelligence focused on Insurance
Avatar for Nom Nom
Real, good food for dogs & cats. Backed by science and made with love
Avatar for Vivun
The first product for presales designed to transform how you sell technology
Avatar for MCSquared Health
Simplifying medical bills so patients and providers can focus on care
Avatar for Nana
Nana is up-skilling the 10M people who will lose their jobs because of tech automation