We are building human-driven NLP technologies and are seeking an experienced, proactive data scientist to join our newly formed engineering team. An ideal candidate will have a strong background in natural language processing methods and experience with a range of text classification problems. We have a particular interest in event extraction/detection, entity recognition, slot-filling tasks, document classification, and zero-/few-shot learning. Secondary experience in forecasting, time series, and/or active learning methods will also be beneficial. As this is a new team, we are looking for candidates who are willing to help grow the organization by taking on a range of responsibilities across the technical spectrum, as well as effectively collaborate to deliver findings to our customers.
The position may require some on-site work in Northern Virginia for team and client meetings.
Responsibilities Your day-to-day will include:
Research and develop machine learning methods for a wide range of text extraction and classification tasks, often with limited labels and/or in multiple languages
Design and implement tools for monitoring and forecasting trends in signals derived from text data
Work effectively, in an often self-directed environment, to estimate timelines, communicate progress, and identify avenues for future research and development
Perform analyses and generate detailed data products for internal stakeholders and external clients
Institute MLOps principles in our software development practices and platform development
Deliver version-controlled, documented, and reproducible analyses and experiments that can be readily transitioned into scalable inference services
Work Experience and Skills
Advanced degree in computer science, math/statistics, engineering, linguistics, social science, or a related field
5+ years of experience in the data science field (this is flexible depending on academic work)
Proficiency with major Python data science libraries, including the SciPy stack and Scikit-learn
Experience with at least one deep learning and/or NLP framework (Tensorflow, PyTorch, Transformers, etc.)
Knowledge and understanding of pre-trained language models like BERT and GPT
Familiarity with other commonly used technologies including Linux operating systems, SQL/NoSQL databases, etc.
Ability to use git, as well as other version, workflow, and project management tools and technologies
Possess strong communications skills, with the ability to communicate complex ideas clearly and concisely to a range of audiences
Aptitude for learning quickly and a willingness to take on a wide range of responsibilities
Experience with other technologies and platforms in our stack, including: Elasticsearch, Kibana, Docker, DVC, Kubernetes, GCP, GitLab
Prior work in the marketing/communications and/or defense sectors
Ability to obtain and/or maintain a US government security clearance
Please mention that you found the job at Jobhunt.ai
Other machine learning jobs that might be interesting
Statistics and Computer Science Specialist - Hawk-Research Worldwide, 100% Remote - Salary: 12000-30000
Hawk-Research is looking for Statistics expert to provide assistance on our projects in academic research sphere. We are building a knowledge sharing platform to help people during their studies, so t...
Lead Machine Learning Engineer - Spectrum Labs(July 2022) Remote US/Canada, 100% Remote
Are you well-versed in and excited by the unique challenges of applying NLP and NLU to content moderation? Do you regularly leverage deep learning frameworks such as Tensorflow, PyTorch, and ONNX to b...Machine Learning Manager - Kalepa(July 2022) Remote US, 100% Remote - Salary: $200k – $300k
Kalepa is a rapidly growing, well-financed Series A start-up re-imagining the trillion-dollar commercial insurance industry. Our cross-functional teams leverage AI, cutting-edge tech and tons of data ...
Senior AI Research Scientist - Earth Species Project(August 2022) Worldwide, 100% Remote
The Earth Species Project (ESP) is a nonprofit organization dedicated to decoding animal communication and translating non-human language.
ESP partners with biologists and machine learning researche...