Data Engineer / Machine Learning Engineer at Threadloom
🇺🇸 United States › California › Palo Alto (Posted Jul 1 2019)
About the company
We curate community content and tell contextual stories. Our curation services are used by 1,000+ forums reaching millions of people. We started in 2015 and have 2 locations – Palo Alto and Bellevue. Our team comes from a mix of startup and big tech backgrounds, but we all share a desire to build a better Internet. We are Stanford StartX alumni (2016).
Threadloom is looking for an experienced data engineer with strong machine learning experience.
This is a foundational role. You will be Threadloom's first engineer solely responsible for building and extending our processing pipelines. Working closely with Product, Ops and Eng it will be your job to design and develop the data warehouses used by all of our services and products. This includes ownership of the processing of billions of documents that power Threadloom Search and Newsletter and upcoming consumer products.
The ideal candidate is passionate about building large-scale, high-volume pipelines that manage and store mission-critical data. This person is conversant with current cloud platforms for parallel processing and storage, and can easily translate product and user requirements to data requirements for storing and managing data. They should also have experience with building machine learning models which classify and rank content and predict user preferences.
The ideal candidate also cares about our end users and is a careful steward of their data, so is also comfortable with modern user privacy standards and has experience applying them in real-world situations.
Skills & requirements
3+ years of relevant work experience
Launching consumer products that people love, at scale
Designing and implementing data pipelines and warehouses
Optimizing servers and pipelines to manage operational costs at scale
Building systems to handle user authentication and PII (e.g. Firebase, OAuth, GDPR)
Deploying production cloud services (e.g., Google Cloud, AWS, Azure)
Languages and tools
Python required, Scala/Java desired
Fluency with the latest tools, libraries, and infrastructure for building and maintaining production-level data pipelines and storage, including
distributed data processing frameworks (e.g., Hadoop, Spark, Flink, Apache Beam)
SQL and NoSQL databases (e.g., MySQL, Postgres, Cassandra, Redis)
stream processing frameworks (e.g., Kafka, Storm, Spark)
search engines (Elastic, Solr)
Built & launched ML models in a production environment
Scaling experimental models from proof-of-concept to live products that handle large-scale data
Comfortable building scalable backends, RESTful web services and APIs
Other machine learning jobs that might be interesting
Senior Machine Learning Engineer - Stoneridge (January 2021)
Novi, Michigan, United States
The Senior Machine Learning Engineer will develop state-of-the-art vision object detection and tracking algorithms based on Stoneridge’s reward winning product “MirrorEye”. Machine learning and deep learning are the major tools for the perception algorit...
Machine Learning Engineer - TransRe (January 2021)
NYC, New York, United States
This role will be part of our Applied Data Team and will be responsible for providing Machine Learning Engineering support.
Tasks & responsibilities required of this role include but are not limited to:
• Construct machine learning models including data co...
Machine Learning Engineer - SoFi (January 2021)
San Francisco, California, United States
Staff Data Scientist (Machine Learning Engineer)
San Francisco, California
The Invest Data Science team is looking to add data scientists / ML engineers (combined roles), who will help shape Invest product develo...
NLP Data Scientist - Aisera (January 2021)
Palo Alto, California, United States
AI / ML Data Science · Palo Alto, California
There are many examples of disruption in the consumer space – Uber disrupting the cab industry, Airbnb disrupting the hospitality industry and so on; but have you wondered who is disrupting support and operations...
Machine Learning Engineer Intern - Mailchimp (January 2021)
Atlanta, Georgia, United States
The Mailchimp internship program started in 2013. In the past 7 years, we’ve had the privilege of bringing in some of the brightest, most talented college students from around the country to spend time with our teams. Our 12 week internship program was design...
Not the machine learning job you are looking for?
Browse all machine learning jobs
and we're sure you will find a suitable one!