hero




The world’s largest collection of jobs backed by Venture Capital & Private Equity firms

2,952
companies
94,645
Jobs

Senior Gen AI Engineer

Excellarate

Excellarate

Software Engineering, Data Science
Pune, Maharashtra, India
Posted on Wednesday, September 11, 2024

Description

Role – Senior GenAI Engineer

Job Description:

We are looking for a talented Generative AI Engineer with expertise in integrating large language models (LLMs) via APIs (Azure or Bedrock), as well as proficiency in Python and an orchestration framework such as LangChain or LlamaIndex. The ideal candidate will be responsible for developing and optimizing applications leveraging generative AI, prompt engineering, retrieval-augmented generation (RAG), and LLM integration.

Key Responsibilities:

  • Design, develop, and implement LLM-based solutions using Azure OpenAI, AWS Bedrock or other APIs.
  • Integrate LLM APIs for various applications, including chatbots, virtual assistants, and custom NLP solutions.
  • Employ prompt engineering techniques to optimize model performance and output.
  • Implement retrieval-augmented generation (RAG)models to enhance LLM response accuracy.
  • Develop LLM workflows and pipelines using frameworks such as LangChain and LlamaIndex for seamless integrations.
  • Collaborate with cross-functional teams to define requirements and deliver AI-powered solutions.
  • Debug, test, and optimize existing LLM integrations to ensure consistent performance.
  • Stay up-to-date with the latest advancements in LLMs, prompt engineering, and generative AI technologies.

Qualifications:

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related fields.
  • Total experience - 5-8 years
  • 3+ years of hands-on experience working with Pythonin AI and NLP-based applications.
  • Proven experience in integrating LLM APIs(Azure OpenAI, AWS Bedrock, or similar).
  • Strong understanding of prompt engineeringand techniques to optimize LLM behavior.
  • Expertise in orchestration frameworks such as LangChain or LlamaIndex for LLM application development.
  • Experience with retrieval-augmented generation (RAG)models and integrating them into real-world applications.
  • Solid understanding of NLPprinciples and LLM-based solutions.

Preferred Skills:

  • Experience with cloud platforms such as Azure, AWS, or Google Cloud.
  • Proficiency in API integration and handling LLM services at scale.
  • Familiarity with vector databases and semantic search techniques.
  • Strong problem-solving skills and ability to work in agile teams.
  • Excellent communication and collaboration skills, able to work with both technical and non-technical stakeholders.