NLP Engineer/Data Scientist

$65k – $125k • 0.5% – 0.75%
Build our NLP capabilities for the mining of unstructured and quasi structure data.
Research and evaluate new/different approaches to NLP problems.
Produce deliverable results and take them from development to production in collaboration with our engineers.

-Min three years commercial production experience.
-Must be fluent both in written and spoken english.

Expertise in the following: Custom Named Entity Extraction, Document Classification, Topic Modeling, tabular data extraction.

Experience with noisy and/or unstructured textual data and various traditional extraction techniques such as regular expressions and tabular data coupled with NLP.

Strong understanding of text pre-processing and normalization techniques, such as tokenization, POS tagging, BILOU parsing and how they work at a low level in a training network.

Strong knowledge of Python and R, and general software development skills (source code management, debugging, testing, deployment, etc.)
Strong knowledge of Docker and orchestrators such as Kubernetes.
Strong understanding of training networks such as CNNs for custom entity recognition.
Strong knowledge of Spacy and various NLP libraries

Expertise in guiding annotation, producing, processing, evaluating and utilizing training data.

MSc./PhD in Computer Science, Computational Linguistics or related fields

Strong interest in, and knowledge of Artificial Intelligence and its subfields.
Experience with Deep Learning and Word Embeddings.
Experience with open-source NLP toolkits such as CoreNLP, OpenNLP, NLTK
Experience with open-source ML/math toolkits such as scikit-learn, MLlib, Theano, NumPy, etc.
