Do something Great

At Vietnam Silicon, we are on a mission to innovate and create world-class technology solutions.

  1. Home
  2. Career
  3. Principal Data Scientist

Principal Data Scientist

Ho Chi Minh Office
Full-time
Department: Technical
Expiration Date: 2025-12-31
Share this job

Job Brief

We are seeking a highly skilled and experienced Principal Data Scientist with deep expertise in embedding technologies, deep learning architecture, and language model deployment. The ideal candidate will drive the development of our AI capabilities, lead technical innovation initiatives, and collaborate with cross-functional teams to deliver transformative AI solutions for our clients across Southeast Asia.


Responsibilities

  1. Design and implement advanced search systems utilizing embeddings, hybrid search, cross-encoders, and reranking techniques
  2. Architect, build, and fine-tune deep learning models from scratch, developing custom loss functions for specific use cases
  3. Create, organize, and manage high-quality datasets for model training and evaluation
  4. Deploy and optimize Large Language Models (LLMs) and Vision-Language Models (VLLMs) in production environments
  5. Develop interactive dashboards using Streamlit to visualize model performance and business outcomes
  6. Conduct comprehensive evaluations of model performance with rigorous metrics and testing methodologies
  7. Mentor junior team members and contribute to Vietnam Silicon's technical leadership in the region
  8. Collaborate with business stakeholders to translate requirements into effective AI solutions


Requirements

Must have

  1. Experienced on training/finetune LLM (LLAMA3, QWEN, Mistral,…)
  2. Design database to do information retrieval with big data (GB/TB of text)
  3. Design data pipelines to clean data, data quality to build big knowledge base of chatbot
  4. 10+ years of professional experience in NLP, LLM
  5. Proven experience with embedding technologies, vector search, and reranking systems
  6. Demonstrated ability to design deep learning architectures and implement custom loss functions
  7. Strong experience in dataset design, creation, and management for AI applications
  8. Proficiency in deploying and optimizing LLMs and VLLMs in production environments
  9. Experience building interactive dashboards with Streamlit or similar tools
  10. Advanced Python skills and expertise with major ML frameworks (PyTorch, TensorFlow, Hugging Face)
  11. Excellent problem-solving and communication skills
  12. Ability to work collaboratively with cross-functional teams


Prefer

  1. Experience with GPU cluster management for distributed AI training
  2. Expertise in fine-tuning large language models for specific domains
  3. Familiarity with Southeast Asian languages and regional business contexts
  4. Contributions to open-source ML/AI projects or research publications
  5. Experience implementing AI solutions in enterprise environments
  6. Knowledge of MLOps practices and tools for model deployment and monitoring
Recruitment Process
1
Application Review
2
Initial Conversation
3
Technical Interview
4
Filnal Discussion
5
Offer & Welcome
Please upload your Resume
Select relevant documents to upload your Resume
You are applying for
Contact Details