Job Overview

AI Engineer – LLM/NLP (Hybrid Working)

Vexere: Revolutionizing Travel and Transportation in Vietnam
We are a Vietnamese technology company aiming to revolutionize the travel and transportation industry. As the largest online bus ticketing platform in Vietnam, we help millions of travelers make unforgettable journeys. Recently we expanded to airlines, trains, and vehicle rentals in order to provide our customers with the best options for their travel plans.
Vexere’s Culture:
Vexere members believe our organization exists to solve pressing societal problems. Our company culture emphasizes community spirit, mutual support, and open communication. We encourage all members to express ourselves, contribute ideas on products and strategies, and strive to become future leaders.
Vexere’s Vision:
Vexere aspires to be a constantly growing and innovative business. We foster a strong culture of learning and development, aiming to leverage our technological strengths to continue revolutionizing the transportation and tourism sectors across Southeast Asia.

We are rebuilding every customer and employee touch-point as an AI-native experience. That ranges from booking a flight/bus/train seat to automating sale processes or leave requests. Engineers here get clear goals, full context, and the freedom to ship quickly while keeping reliability and safety paramount.

Responsibilies:

Major projects you will tackle

  • Omni-channel customer chatbot that handles FAQs, after-sales, and ticket booking for bus, flight, train, and vehicle-rental verticals—across text and voice.
  • Department-specific assistants for HR onboarding, finance invoice queries, operations incident triage, IT help-desk, and more—each powered by shared LLM components and n8n triggers.
  • Company-wide automation hub built on n8n, including reusable nodes and guard-rails that let non-technical teams create flows safely.
  • Multimodal expansion that blends text, speech, and images so customers can, for example, upload a ticket photo or speak a booking change request.
  • You will fine-tune foundation models, fuse retrieval with LLM reasoning, and iterate in Vietnamese, English, and other languages.

What you will do

  • Design, implement, and continually improve our multi-agent framework. Build and refine agent components such as short- and long-term memory stores, planning/reflection loops, agent-to-agent messaging, and specialised prompt templates that let multiple agents collaborate on complex tasks.
  • Select and adapt OpenAI, Gemini models, Llama, Mixtral or better—using LoRA or full fine-tuning—so the models speak our brand voice in multiple languages.
  • Build and optimise retrieval-augmented pipelines, keeping latency below two seconds and hallucinations under five percent.
  • Craft prompts, refusal rules, and jailbreak tests; automate factuality, safety, and multimodal hallucination checks in CI.
  • Package models with Docker and serve them through FastAPI or gRPC behind vLLM, Triton Inference Server, or Text-Generation-Inference; add monitoring with Grafana / Prometheus for latency, drift, and GPU cost (for later milestones).
  • Pair daily with Conversation Designers, MLOps engineers, Automation engineers, and business stakeholders; publish model cards, data sheets, and rollback plans.

Requirements: 

Must-have qualifications

  • 1+ years of experience shipping NLP or generative-AI systems to production.
  • Strong Python and PyTorch skills plus heavy use of Hugging Face (Transformers, PEFT, Datasets).
  • Practical knowledge of vector databases such as milvus, pgvector, Qdrant, or Pinecone and hybrid-search techniques.
  • Demonstrated ability to raise containment or reduce hallucinations through data-driven experiments.
  • Solid engineering habits: Git, code reviews, unit and integration tests, CI/CD pipelines.
  • Clear spoken and written English; Vietnamese fluency is a bonus.

Nice-to-haves

  • Hands-on experience with Rasa, Dialogflow CX, or ASR/TTS pipelines for voice bots.
  • Deep GPU-performance tuning, including quantisation, KV-cache optimisation, or custom Triton kernels.
  • Familiarity with multi-agent frameworks such as PydanticAI, CrewAI, LangGraph or others.
  • A privacy-by-design mindset and working knowledge of GDPR or PDPA compliance.

Benefits: 

  • Competitive salary + KPI-based quarterly bonuses.
  • Hybrid Working
  • Clear career progression with opportunities for advancement to key positions based on your capabilities.
  • Special discounted bus tickets for employees and their families.
  • Comprehensive periodic health check-up policies.
  • Participation in social insurance, health insurance, and unemployment insurance according to Vietnamese labor law after the probationary period.
  • 12 days of annual leave, with an additional day added every 3 years (convertible to salary).
  • Vibrant and dynamic working environment with a friendly and supportive team that shares knowledge and assists each other.
  • Training and development opportunities in negotiation, communication, work management, interpersonal skills, and software technology.
  • Free parking and allowances: Marriage, Newborn baby and others are applied.
  • A spacious pantry fully equipped with a coffee maker, microwave, milk , tea and more
  • A wide range of sports and social activities: badminton, pickleball, football, etc.
  • Other perks to be discussed during the interview.

For more information, please contact us via:

  • Send your Resume to email: [email protected], with title: Fullname – Applied Position
  • Phone/ Zalo: 039 555 54 86 (Mr. Nhân)

Office Location: Vexere Trading and Services Co., Ltd. – 2nd Floor – Building H3, 384 Hoang Dieu, Ward 6, District 4, HCMC

Working Hours: 8:30 am – 6:00 pm from Monday to Friday, and Saturday morning.

Apply for this job
Share this job