Background
Logo of NorthEastern.dev
NorthEastern.dev
Data Scientist

Qode

fulltime

Posted on: 6/12/2025

Required Skills:

Pythondeep learningNLP

Job Description:

Key Responsibilities

  • Build and optimize classification, regression, and forecasting models using classical ML and deep learning techniques.
  • Develop and deploy deep learning architectures including LSTMs, transformers, and other sequence-based models for time-series, NLP, and anomaly detection.
  • Design and implement NLP pipelines for text classification, semantic search, summarization, and question answering using transformer-based models (e.g., BERT, T5, GPT).
  • Create RAG (retrieval-augmented generation) pipelines integrating LLMs with vector databases (e.g., FAISS, Pinecone, Weaviate) and document indexing frameworks.
  • Apply and fine-tune LLMs (e.g., OpenAI, Mistral, LLaMA, Cohere) for domain-specific tasks using supervised fine-tuning or LoRA/QLoRA methods.
  • Build and orchestrate multi-agent AI systems using frameworks like LangGraph, CrewAI, or OpenAgents to support tool-using, autonomous agents for decision-making workflows.
  • Collaborate with data engineers, product managers, and stakeholders to translate business needs into production-ready solutions.
  • Mentor and support junior data scientists through code reviews, model design feedback, and collaborative experimentation.
  • Promote best practices in reproducible modeling, responsible AI, and scalable deployment.



Required Skills & Experience

  • 5+ years of experience in data science or applied machine learning, with a strong background in both classical and deep learning methods.
  • Hands-on experience with Python, and libraries/frameworks such as scikit-learn, pandas, PyTorch, TensorFlow, Hugging Face Transformers, and LangChain.
  • Strong understanding of classification metrics, feature engineering, model validation, and hyperparameter tuning.
  • Demonstrated experience with LLMs, including fine-tuning, prompt engineering, and retrieval-augmented generation techniques.
  • Familiarity with vector databases, embedding models, and chunking strategies for unstructured data (e.g., PDFs, knowledge bases).
  • Experience working with multi-agent architectures or orchestration tools like LangGraph, CrewAI, or AutoGPT.
  • Solid skills in data analysis, visualization, and communicating technical insights clearly to mixed audiences.
  • Knowledge of cloud platforms (AWS/GCP/Azure) and deployment tools (e.g., Docker, MLflow, FastAPI).


Stay Updated with Job Alerts

Get the latest job openings delivered straight to your inbox.