AI Engineer Data Scientist (NLP & Cybersecurity)
New Yesterday
Salary:
Position Overview
We are seeking an AI Engineer Data Scientist with strong expertise in Natural Language Processing (NLP), predictive modeling, and data engineering. The ideal candidate will develop and implement machine learning models that process large-scale unstructured and structured text data in the cybersecurity domain. This position will be based in our Tysons Corner, Virginia office, offering free parking and within 10 minutes of the Metro station.
Main Responsibilities
NLP & Predictive Modeling
Design, develop, and deploy NLP pipelines for extracting, processing, and analyzing large-scale cybersecurity-related text data (e.g., threat reports, logs, vulnerability disclosures).
Build and optimize predictive models to identify, classify, and forecast cybersecurity risks and trends.
Implement advanced algorithms such as named entity recognition (NER), topic modeling, sentiment analysis, and text summarization.
Data Engineering & Data Science
Develop data ingestion and transformation workflows from multiple structured and unstructured data sources.
Design and maintain scalable data pipelines and data lakes to support analytics and model training.
Conduct exploratory data analysis (EDA) to identify patterns, anomalies, and actionable insights.
Perform feature engineering to enhance model accuracy and relevance.
Cybersecurity Application
Leverage NLP and AI to detect, analyze, and predict cyber threats, vulnerabilities, and attack patterns.
Collaborate with cybersecurity analysts to validate model outputs and integrate AI-driven insights into security workflows.
Visualization & Reporting
Create interactive dashboards and visualizations (Power BI, Tableau, or similar) to communicate findings to technical and non-technical stakeholders.
Prepare and present analytical reports summarizing methods, findings, and recommendations.
Qualifications Required:
Bachelors or Masters degree in Artificial Intelligence, Data Science, Computer Science, or related field.
Strong proficiency in Python and libraries such as spaCy, NLTK, Hugging Face Transformers, TensorFlow, or PyTorch.
8-10 years of experience building, training, and deploying NLP models in production environments.
Strong knowledge of data engineering concepts: ETL processes, SQL/NoSQL databases, and data pipeline tools (e.g., Apache Spark, Airflow).
Solid understanding of machine learning methods and predictive modeling.
Excellent analytical, problem-solving, and communication skills.
Need to be a US citizen.
Preferred:
Experience working with cybersecurity datasets (e.g., CVE data, threat intelligence feeds, log analysis).
Familiarity with cybersecurity frameworks (e.g., MITRE ATT&CK, NIST).
Experience with cloud platforms (AWS, Azure, or GCP).
Familiarity with containerization (Docker) and CI/CD for ML deployment.
Benefits & Perks
Competitive salary based on experience and qualifications.
Medical, dental, and vision insurance for employees and dependents.
401(k) with employer match.
Paid time off (PTO) and holidays.
Short-term and long-term disability coverage and Life insurance.
Free parking and close access to public transit.
It is EnDynas policy to promote equal employment opportunities. All qualified applicants will receive consideration for employment without regard to sex, race, color, ethnicity, age, national origin, citizenship, religion, physical or mental disability, medical condition, genetic information, pregnancy, family structure, marital status, ancestry, domestic partner status, sexual orientation, gender identity or expression, veteran or military status, or any other basis.
- Location:
- Mclean
- Category:
- Technology