Site logo

Command Palette

Search for a command to run...

Sai Pavan
Sai Pavan's avatar
Gollapalli Naga Satya Sai Pavan

Sai Pavan 

AI/ML Engineer

Overview

Software Developer @Sify Technologies Limited

Hyderabad, Telangana, India

he/him

Social Links

About

Hello! I am Sai Pavan — a Energetic and innovative Generative AI Engineer with 2 years of experience specializing in AI/ML frameworks, MLOps, containerization (Docker), and orchestration using Kubernetes and Cloud platforms (AWS, GCP). Aims to apply technical skills and creativity in developing state-of-the-art AI solutions within a dynamic team setting.

Let's connect and collaborate!

Tech Stack

Experience

Sify Technologies Limited

Current Employer
  • Developed and deployed AI/ML solutions, focusing on data processing, predictive analytics, and automation.

  • Collaborated with cross-functional teams to integrate AI models into scalable enterprise-level solutions, improving operational efficiency and decision-making.

  • Conducted advanced research and prototyping to integrate and scale AI solutions into learning management systems and standalone applications.

  • Customized and deployed open-source AI models on private cloud infrastructure for scalable inference, utilizing custom scripts, XInference, Ollama, Nvidia/TensorRT-LLM and vLLM.

  • Python
  • Docker
  • Kubernetes
  • AWS
  • GCP
  • Dify
  • RAG
  • Gen AI
  • Socket IO
  • WebRTC
  • Streamlit
  • Selenium
  • ComfyUI
  • TensorFlow
  • PyTorch
  • Pipecat
  • Daily
  • Open-source and proprietary LLMs
  • Multi-models
  • LlamaIndex
  • Gradio
  • Flask
  • FastAPI
  • Langchain
  • Langgraph
  • Langfuse
  • Gained hands-on experience with Generative AI technologies, tools, and frameworks, including model inference and deployment.

  • Developed hands-on proficiency with AWS cloud and subsequently cleared the AWS Certified Developer – Associate certification through practical learning.

  • GenAI
  • LLMs
  • AI frameworks
  • AWS
  • AWS Certified Developer – Associate

Cisco Networking Academy

  • During this internship, I worked on understanding the fundamentals of computer networking and IP services, including network design, IP addressing, and routing protocols. The primary focus was to help develop an in-depth knowledge of Cisco’s Routing and Switching technologies, which are vital for building scalable and secure networks.

  • Networking & Routing Configuration: Gained hands-on experience in configuring and troubleshooting IP routing protocols, while developing a solid understanding of networking models such as the OSI and TCP/IP stacks, and subnetting for efficient IP address management.

  • Switching & Network Services: Configured VLANs, and Ethernet switching to ensure optimal network performance, and implemented essential IP services like DHCP, DNS, and NAT to support dynamic address allocation, domain resolution, and secure communication across the network.

  • Networking
  • Routing
  • Switching
  • CCNA

Educations

  • Bachelor of Engineering in Computer Science; CGPA: (8.70/10.0)
  • Diploma in Computer Engineering; Percentage: (89.69/100)
  • Board of Secondary Education; CGPA: (9.7/10.0)

Projects(5)

  • Search Engine - Architected a cloud-agnostic solution combining open-source and commercial components, leveraging vector databases and GCP Vertex AI services. Also includes an Elasticsearch sandbox POC.
  • Implemented data enrichment pipelines for metadata tagging, data refinement, spell check, and missing-text interpolation to enhance recall and content quality across text and image assets. Asset Captioning & Descriptions
  • Developed custom celebrity recognition using Histogram of Oriented Gradients (HOG) features automatically identify and tags individuals in new images, with support for adding new celebrities. Enabling retraining with curated labels and thresholded inference for precision.
  • GCP
  • Vertex AI
  • NLP
  • Histogram of Oriented Gradients (HOG)
  • Google Vision API
  • Elasticsearch
  • Firestore
  • Embedding Models
  • Vector Databases
  • Open-source and proprietary LLMs
  • Knowledge Graphs
  • Developed an advanced Conversational AI system with a virtual avatar capable of mimicking human voice, expressions, and emotions, delivering highly interactive, personalized, and context-aware user experiences. Integrated Retrieval-Augmented Generation (RAG) and search capabilities to provide dynamic, immersive, and emotion-driven interactions across various scenarios with both visual and verbal feeds.
  • Built in three versions: Unity WebGL for integration with SkillFLO, Unreal Engine for immersive Experience Center demo, and Unity EXE for DevLearn Conference presentations.
  • Demos: NORA (Next-gen Operational Response Agent) presented at DevLearn, and PUJA (Personal Universal Job Assistant) designed for Experience Center as scenario-driven avatar.
  • Histogram of Oriented Gradients (HOG)
  • Retrieval-Augmented Generation (RAG)
  • Gen AI
  • Real-Time Communication (RTC)
  • Socket IO
  • Python
  • FastAPI
  • Integration and deployment of Gen AI models
  • Content Automation pipelines
  • Backend Development
  • Architected an agentic AI workflow using LangGraph (Python) to automate ticket resolution within Sify’s cloud management ecosystem, introducing intelligent state-based decision-making.
  • Developed a proof-of-concept for automated infrastructure provisioning on AWS using MCPs provided by AWS, significantly streamlining and accelerating ticket handling workflows.
  • Creating MCPs like an SSH server and a code generator (generator, debugger, executor) to streamline communication between the LLM and Sify’s cloud infrastructure.
  • Building a fully autonomous ticket management system leveraging agentic states and dynamic nodes to process, diagnose, and resolve incidents — escalating to human intervention only when necessary.
  • Implemented multiple MCP servers to enable seamless communication between AI agents and Sify’s cloud infrastructure, ensuring reliable orchestration and data exchange across environments.
  • Architected a serverless generative AI platform for e-learning using LlamaIndex, LangChain, and AWS, enabling semantic search, automated assessment generation, storyboard creation, and AI-enhanced media libraries.
  • Developed and integrated AI-driven modules for automated content creation (HTML, video, storyboards) and assessment generation from multiple sources using NLP techniques.
  • Engineered AI-powered analytics for custom report generation via text prompts with visual insights, and implemented an AI chatbot to enhance content discoverability and user access to learning resources.
  • Python
  • REST APIs
  • Gen AI Models Integration and deployment
  • Content Automation pipelines
  • Backend Development
  • Prompt Engineering
  • Developed an AI-powered interactive assistant for children (ages 4-12) using GPT-4o, Python, and FastRTC, delivering real-time bedtime stories, homework help, and engaging conversations.
  • Built a safe, age-appropriate interface with dynamic multi-agent context switching and adaptive storytelling for personalized learning experiences.
  • Introduced imaginative roleplay and AI-generated quizzes, fostering creativity, engagement, and knowledge retention through immersive interactions.
  • GPT-4o
  • Python
  • Multi-Agent Systems
  • Real-time AI
  • WebSockets
  • LSTM (Long Short-Term Memory)

Honors & Awards(2)

Certifications(7)

AWS Certified AI Practitioner

Issued by
Amazon Web Services (AWS)
Issued on

GCP Certified Generative AI Leader

Issued by
Google Cloud Platform (GCP)
Issued on

AWS Certified Developer - Associate

Issued by
Amazon Web Services (AWS)
Issued on

Oracle Cloud Infrastructure 2025 Certified AI Foundations Associate

Issued by
Oracle
Issued on

Oracle Cloud Infrastructure 2025 Certified Generative AI Professional

Issued by
Oracle
Issued on

PCAP: Programming Essentials in Python

Issued by
Cisco
Issued on

Getting Started with Competitive Programming

Issued by
NPTEL
Issued on