Skip to content

JobShark: Find the Right Job

 

Palo Alto, CA Full Time Posted by: BrightAI Posted: 27/01/2026 19:36:40
 
 

Bright.AI is a high-growth Physical AI company transforming how businesses interact with the physical world through intelligent automation. Our AI platform processes visual, spatial, and temporal data from billions of real-world events-captured across edge devices, mobile sensors, and cloud infrastructure-to enable intelligent decision-making at scale.

We are now hiring a Senior AI Engineer - LLM, RAG to lead the development of Retrieval-Augmented Generation (RAG) systems that harness the power of large language models (LLMs) and real-world knowledge sources. This role is pivotal to building next-generation intelligent assistants that help technicians and operators troubleshoot complex issues in industrial settings.

You'll work at the intersection of NLP, foundational models, and Real Time information systems-developing intelligent tools that turn manuals, technician notes, and sensor data into actionable, conversational guidance for the physical world.

Responsibilities
  • Lead the architecture and development of RAG systems that combine LLMs (eg, LLAMA, Mistral, Claude, GPT) with structured and unstructured external information sources.
  • Develop AI-powered assistants to support technicians in diagnosing and resolving anomalies or failures in factory, plant, or industrial settings.
  • Build pipelines to ingest, preprocess, and index large corpora of documents (manuals, logs, notes, procedures) for semantic search and grounding.
  • Customize and fine-tune foundational models to incorporate domain-specific language, tone, and logic for industrial troubleshooting scenarios.
  • Collaborate with product, data, and cloud teams to design scalable, privacy-compliant, and latency-sensitive LLM applications.
  • Design evaluation strategies to measure performance, accuracy, and user experience of RAG-enabled systems in production settings.
  • Stay up to date with the latest advances in LLM architectures, retrieval methods, and prompt engineering, and integrate emerging techniques into the product roadmap.
Educational Background
  • M.S. or Ph.D. in Computer Science, AI, Machine Learning, or a related field, with specialization in NLP or deep learning.
  • Strong research or applied background in large language models (LLMs) and retrieval-augmented generation (RAG) systems. Agentic RAG experience is highly desirable.
Required Skills & Expertise
  • 5+ years of experience in machine learning or AI with a strong focus on NLP, LLMs, or conversational AI.
  • Fluency with modern LLMs and open-source foundational models (eg, LLAMA, Falcon, Mistral, GPT, Claude).
  • Experience building RAG pipelines with tools like LangChain, LlamaIndex, or custom vector database integrations, with at least one production-grade system built.
  • Fluency with prompt engineering, instruction tuning, or fine-tuning open-source models.
  • Deep understanding of document retrieval (semantic search, embedding generation, similarity metrics) and vector stores (eg, FAISS, Weaviate, Pinecone).
  • Strong foundation in core machine learning techniques, including experience with reinforcement learning (RL) or decision-making models.
  • Proficiency with ML development frameworks such as PyTorch, Hugging Face Transformers, or similar. Strong Python programming is a must.
  • Experience integrating AI systems into real-world applications with user-facing interfaces and operational constraints.
  • Excellent problem-solving and critical thinking skills; ability to design solutions for complex, ambiguous problems.
  • Strong written and verbal communication skills, with ability to collaborate cross-functionally with engineers, product managers, and domain experts.
Bonus Qualifications
  • Experience applying LLMs in industrial or physical infrastructure settings (eg, manufacturing, logistics, utilities, energy).
  • Knowledge of industrial control systems, maintenance workflows, or technician support processes.
  • Exposure to multimodal models or integrating textual data with sensor and/or time-series data.
  • Prior experience in a startup or a fast-paced environment building LLM-powered products from the ground up.
Palo Alto, CA, United States of America
Engineering
BrightAI
Click apply
JS26489_25303_0E03563DBEE170FC1E3641E20FD0E1D8
27/01/2026 19:36:40
We strongly recommend that you should never provide your bank account details to an advertiser during the job application process. Should you receive a request of this nature please contact support giving the advertiser's name and job reference.