Welcome

(Gen)AI Business Strategist

Projects and Skillset

My favorite set of acquired skills revolves around Multi-Agentic AI or Multi-Agent AI Systems. How do Multi-Agentic AI workflows operate? It involves designing and creating a team of specialized AI agents that work together to accomplish a defined goal. They use tools and real-time data to execute complex tasks as part of a multi-agent system. These agents work autonomously or collaboratively with humans (i.e. human-in-the-loop) to streamline operations and automate workflows, promising a new standard of efficiency, productivity, and innovation for business leaders.

Multi-Agent AI framework components include:

Hands-on projects developed so far in 2026 thanks to Andrew Ng of DeepLearning.AI and his network of pioneer Generative AI partners including Anthropic, CrewAI, Nvidia, Databricks, and Landing.AI:

Hands-on projects implementing an agentic workflow that combines layout detection with LLM-based reasoning; using Agentic Document Extraction (ADE), LandingAI’s framework for treating documents as visual objects and grounding extracted fields to specific regions on the page: Parsing tables, charts, and multi-column documents, extracting structured outputs like Markdown and JSON with visual grounding, and building and deploying production-ready document pipelines on AWS..

Hands-on projects with Claude.ai, Claude Code, the Claude API, and the Claude Agent SDK, and how skills fit alongside tools, MCP, and subagents in modern agentic systems: Turning general-purpose agents into specialists when needed, structuring skill folders and SKILL.md files for efficient context management, using pre-built Anthropic skills and create custom ones following best practices, combining skills with MCP and subagents for multi-step workflows, and building code generation, review, testing, and research agents with isolated context.

Hands-on projects building a climate data analysis agent with Nvidia's NeMo Agent Toolkit (NAT), then scaling it into a multi-agent workflow with professional-grade deployment. Adding observability with OpenTelemetry and Phoenix tracing to inspect agent reasoning and tool selection. Running systematic evaluations to catch bugs, measure improvements, and support CI/CD. Deploying with production features like authentication, caching, and rate limiting. Orchestrating multi-agent workflows that combine NAT agents with LangGraph, CrewAI, or custom Python agents. Building configuration-driven workflows (via YAML) and serve them as HTTP/WebSocket APIs or NAT UI.

Hands-on projects developed in 2025 thanks to Andrew Ng of DeepLearning.AI and his network of pioneer Generative AI partners including Meta, Anthropic, IBM Research, BeeAI, The Linux Foundation, LangChain, HuggingFace, Arize AI, Astronomer, Replit, LiveKit, RealAvatar.AI, ElevenLabs, CircleCI, Giskard, NexusFlow, AGI, Tavily, and Windsurf.AI:

Red-teaming to attack various chatbot applications to see how the system reacts to toxicity and offensive content, off-topic content, excessive agency, and sensitive information disclosure; techniques used to bypass safeguards include prompt injection, exploiting text completion, biased prompts, gray-box prompt attacks, prompt probing.
Implementing LLMs post-training pipelines: Downloading a pre-trained model from HuggingFace and post-train it using Supervised Fine-Tuning (SFT), Direct Preference Optimization (DPO), and Online Reinforcement Learning (RL) to turn a base model into an instruct model, change the identity of a chat assistant, and improve a model’s math capabilities.
Building with Meta’s new Llama 4 multimodal models Maverick and Scout: 1. Building a 12-language translator chatbot; 2. Detecting objects, drawing bounding boxes, and turning a UI screenshot into executable code; 3. Asking questions over entire books, papers, or GitHub repos without chunking; 4. Improving system prompts automatically with the Prompt Optimization Tool; 5. Generating and curating training data using the Synthetic Data Kit.
Setting up agent communication protocols (ACP): Sequential and hierarchical workflows of agents (CrewAI, Smolagents) hosted inside ACP servers, then importing ACP-compliant agents into a registry to make them easy to discover and share across teams.
Configuring and testing an MCP server using FastMCP and MCP Inspector, adapting a Claude-powered chatbot into an MCP client that connects to both local and third-party servers, expose tools, resources, and prompt templates to an AI app, and connect an MCP server to applications like Claude Desktop and deploy it for remote access.
Transforming a RAG prototype into a robust, automated pipeline using Apache Airflow 3: Scheduling pipelines using both time-based and event-driven triggers, parallelizing tasks with dynamic task mapping, adding retries, alerts, and backfills to ensure reliability, scaling orchestration using real-world techniques from apps like Astronomer’s Ask Astro.
Building AI voice agents for production that listen, reason, and respond in real-time using speech-to-text, LLMs, and text-to-speech and optimizing latency.
Building a continuous integration comprehensive testing framework pipeline using LLMs to detect hallucinations in a quiz generation application.
Creating an AI assistant capable of performing tasks on a computer using multimodal prompting (combining text and images with streaming responses), prompt caching to reduce costs and latency, and tool-use workflows to integrate external tools effectively.
Building a deep-research agent with secure code execution using E2B, an open-source runtime for executing AI-generated code in secure cloud sandboxes, monitoring and evaluating agents.
Developing an autonomous web agent that scrapes, summarizes, fills forms, and carries out multi-step workflows using Monte Carlo Tree Search (MCTS), self-critique, and Direct Preference Optimization (DPO) to help self-correct.
Creating web apps with Replit's vibe-coding agents, hosting and sharing web applications: a website performance analyzer (SEO) and a voting app, using the AI assistant to debug and customize.
Creating AI agents that store, retrieve, and update memory: Building an AI-powered email assistant that can automatically route, respond to, or schedule emails, adapting its behavior based on past interactions.
Developing a chatbot to interface with private data and documents using retrieval augmented generation (RAG) and vector stores.
Composing and customizing chains and agents with LangChain.
Building a Q&A application with LangChain to return information on stored documents and using Wikipedia to answer questions.
Implementing agentic AI systems in LangChain and implementing using LangGraph.

Other hands-on projects developed in 2025:

An app/bot with CrewAI where a Research Analyst agent gathers cutting-edge information from real-time tools (web search), and a Content Strategist agent who rewrites that information into clear, engaging content for a tech-savvy audience.
An app/bot with CrewAI that helps you with meal planning, budgeting, and grocery shopping, by browsing the internet, calculating ingredient quantities, checking prices, organizing shopping lists that fit your budget and dietary needs.
An AI NourishBot App with CrewAI leveraging Meta's advanced multimodal model, Llama 3.2 90B Vision Instruct to handle both visual and textual data. NourishBot features a dietary team (AI multi-agent system) offering tailored nutritional guidance by identifying what's on your plate, giving you insightful nutritional details and actionable tips, offering dynamic, real-time advice based on dietary preferences, and suggesting recipes based on what's in your fridge.
Hands-on projects building multi-agent systems with BeeAI: Cybersecurity analysis, business planning, and travel coordination systems/bots
An AI app with AG2(AutoGen) where a teacher agent collaborates with a planner agent and a reviewer agent to create innovative school lesson plans
An AI app featuring a bug triage assistant with AG2 that classifies bug reports as either escalate (for example, critical crash or security issue), close (for example, minor cosmetic issue), and medium priority (default for others), in a human-in-the-loop workflow.
An AI AutoMed App/multi-agent AI chatbot with AG2 designed to simulate expert medical consultation through intelligent collaboration, mimicking the behavior of a real medical team, where different AI agents collaborate to analyze symptoms, suggest treatments, fetch real-time medical data, and provide follow-up care. AutoMed's specialized agents work together to deliver precise, tailored recommendations based on the user’s health history and real-time input
A Mental Health chatbot with AG2 that identifies emotions based on user input, and provides relaxation techniques and coping strategies.
DocChat, a production-ready multi-agent hybrid retrieval-augmented generation (RAG) system powered by Gradio, LangGraph, Meta models, and IBM watsonX AI. DocChat provides a user-friendly interface for uploading documents, submitting queries, and retrieving AI-generated answers along with verification reports.
Hands-on projects applying AI-powered storytelling, speech-to-text transcription, and text-to-speech synthesis to real-world applications, such as AI-generated audiobooks and automated meeting assistants. 
Designing and deploying multimodal AI applications using tools like OpenAI Whisper, Mixtral, Gradio and LangChain.
Hands-on projects applying state-of-the-art models like DALL·E and Sora to generate images and videos from text prompts, implementing an image captioning system using Meta’s Llama 4, combining vision and language models for real-world applications.
Multimodal retrieval and search, multimodal Question Answering (QA), and chatbots: cross-modal retrieval techniques enhancing search engines and recommendation systems. Hands-on projects: Creating an AI-powered personal shopping assistant that identifies, matches, and retrieves fashion items based on image inputs; using multimodal text-vision models to estimate calorie counts and provide dietary recommendations based on image and text inputs.

Hands-on projects developed in 2024 thanks to Andrew Ng of DeepLearning.AI and his network of pioneer Generative AI partners: Google Cloud, OpenAI, Meta, HuggingFace, Amazon Web Services (AWS), Qualcomm, Mistral AI, MongoDB, Guardrails.AI, Qdrant, Weaviate, Haystack, Pinecone, LlamaIndex, crewAI, Intel, WhyLabs, Upstage AI and many other key players:

Pretraining an LLM from data preparation to model configuration and assessment. Modifying existing models using Depth Upscaling to reduce training costs.
Building and deploying a sophisticated AI agent capable of handling real-world customer support scenarios, fully serverless, and ready to scale with Amazon Bedrock. Integrating tools, code execution, and guardrails to manage agentic actions effectively with safeguards to prevent malicious prompts and unintended outputs. Project example: Building a customer service bot for a tea mug business that can handle tasks like answering queries, retrieving information, and processing orders. Connect your customer service agent to services like a CRM to get customer details and log support tickets in real time.
Prompting and customizing LLM responses using Amazon Bedrock. Summarizing audio conversations by first transcribing an audio file and passing the transcription to an LLM. Deploying an event-driven audio summarizer that runs as new audio files are uploaded using a serverless architecture.
Deploying AI models on edge devices like smartphones using their local compute power for faster and more secure inference. Explore model conversion by converting PyTorch/TensorFlow models for device compatibility, and quantize them to achieve performance gains while reducing model size. Device integration including runtime dependencies.
Exploring Mistral 7B, Mixtral 8x7B, and Mixtral 8x22B (open source) and Small, Medium, and Large (commercial) to select the right model for a use case: Effective prompting techniques, function calling, JSON mode, and Retrieval Augmented Generation (RAG).
Building an AirBnB recommendation system. Setting up a MongoDB database, a vector database, and a query database using text and image embeddings, building a retrieval augmented generation (RAG) aggregation pipeline applying pre- and post-filtering. Projections, boosting, and prompt compression.
Hands-on multimodal prompting for tasks requiring advanced abstract and complex reasoning in practical applications like coding, vision tasks, and building workflows that balance intelligence and cost. Meta-prompting to optimize results.
Using Canvas’ side-by-side workspace to brainstorm, draft, and refine text and code with ChatGPT. Using tools for debugging, targeted editing, and adding final polish. Building practical use cases: creating game apps, generating Python code from plot screenshots, and designing SQL databases from architecture images.
Multimodal prompting with Llama for advanced image reasoning use cases such as understanding errors on a car dashboard, adding up the total of photographed restaurant receipts, grading written math homework. Function calling and custom tools with examples for web search and solving math equations.
Adding AI validations (guardrails) to a RAG-powered customer service chatbot, implementing techniques to validate and verify inputs and outputs, building custom protections including personal identifiable information detection, focused response controls, building name detection pipeline, entity recognition, checking for hallucinations and prompt injections using natural language inference (NLI).
Building faster and more relevant vector search for LLM applications.
Designing and executing real-world applications of vector databases including hybrid and multilingual searches.
Building applications including a retrieval augmented generation (RAG) app, a news summarization app, a chat agent with function calling and more.
Building six applications powered by vector databases, including semantic search, retrieval augmented generation (RAG), and anomaly detection.
Building smarter search and Retrieval Augmented Generation (RAG) applications for multimodal retrieval and generation using Weaviate vector databases, GPT-4 and Gemini Pro Vision.
Agentic generative AI applications that allow large language models to work with your data in any format, using GPT-3.5 Turbo.
Real world business use cases building advanced AI agentic workflows with Retrieval Augmented Generation (RAG). Practical applications include project planning, lead scoring, customer support analysis, and content creation at scale. Pipeline performance evaluation.
Large Vision Language Models (LVLM). Multimodal RAG (MMRAG) system architecture. Embeddings. Preprocessing videos for MMRAG. Multimodal retrieval from vector stores. Multimodal RAG with Multimodal Langchain. Using OpenAI's Whisper and Large Language and Vision Assistant (LlaVA).
Best practices for multimodal prompting and parameter control. Creating use cases with images. Developing use cases with videos i.e. “finding a needle in a haystack”. Integrating real-time data with function calling.
Building an end-to-end workflow application for LLMs. Design and automation steps to tune an LLM for a specific task and deploy it as a callable API. LLMOps best practices. Responsible AI by outputting safety scores on sub-categories of harmful content. Hands-on projects to adapt a supervised tuning pipeline to train and deploy a custom LLM acting as a question-answering Python coding expert.
Prompt engineering best practices for application development and building a custom chatbot.
Exploring real-world scenarios to evaluate the safety and security of LLM applications, protecting against potential risks like hallucinations, jailbreaks, and data leakage.
Creating demos of machine learning applications for image generation, captioning, and text summarization; sharing apps on Hugging Face Spaces.
Performing text, audio, image, and multimodal tasks using Hugging Face transformers and sharing AI apps using Gradio and Hugging Face Spaces. Turning a small language model into a chatbot, summarizing documents, converting audio to text with Automatic Speech Recognition (ASR), and text to audio using Text to Speech (TTS). Performing image captioning and segmentation. Deployment options. Hands-on project: Deploying an image captioning API on Hugging Face Spaces.
Hands-on programming projects building a Multi-Agentic AI framework for orchestrating role-playing, autonomous AI agents that collaborate as a team to solve business problems such as: research and write an article; implement customer support automation; design a customer outreach campaign; automate event planning; collaborate for financial analysis; tailor job applications.

The "House of Analytics"* below illustrates the scope of my quantitative skills (applied statistics & applied mathematics, the 4 pillars):

Skills acquired since 2024 related to Generative AI include prompt engineering* for text, image, code, speech, video, data generation/preparation/querying/augmentation, Gen-AI machine learning modeling, Gen-AI driven interactive dashboarding and storytelling. LLMs for text classification and sentiment detection, translation capabilities, code generation, text summarization and question-answering

*Prompt engineering: Generative artificial intelligence (AI) systems (like ChatGPT) are designed to generate specific outputs based on the quality of provided prompts or instructions received. Prompt engineering helps generative AI models better comprehend and respond to a wide range of queries, from the simple to the highly technical. The basic rule is that good prompts equal good results.

Have a look at some of my advanced statistical analyses on RPubs as well as Tableau for sample visualization projects.

How I got here

Passion for lifelong learning and innovation fuels my journey through the dynamic fields of applied statistics and AI, always with a focus on leveraging these tools for tangible business impact, so I actively engage with programs and training that connect business strategy to cutting-edge research and technology. My curiosity extends to operations research, data science, econometrics, and advanced quantitative methods—all vital for tackling complex business challenges.

My adventure began in 2016 with a transformative encounter: MIT's "The Analytics Edge" course on edX, brillantly delivered by Professor Dimitris Bertsimas, Massachusetts Institute of Technology, Sloan School of Management. This experience marked a pivotal moment as I realized the immense potential of merging both the MBA mindset and quantitative/AI prowess within a single individual, without any intermediaries. His approach ignited my dedication to combining Artificial Intelligence within business strategy. Since then, I've been captivated by the relentless pursuit of knowledge in the dynamic landscape of Artificial Intelligence—an interdisciplinary arena that constantly challenges and inspires. I've become somewhat addicted to the exhilarating journey of upskilling, recognizing that in this ever-evolving field, the learning never ceases. Join me on this exhilarating journey, and let's unlock the potential of AI together!

Inspired by sharp minds, folks with a growth-mentality, and true competence (hint: work experience (i.e. a work certificate) does NOT necessarily mean someone is competent), I chose two quotes from statisticians who endeavor to approach everything they do thoroughly - and do it right:

W. Edwards Deming , an American engineer, statistician, professor, author, lecturer, and management consultant, pointed out that:

To emphasize the importance of striving to really understand what you're doing, Prof. Russ Lenth – Department of Statistics and Actuarial Science, University of Iowa - puts it nicely in this enlightening, yet quite provoking example:

Prof. Lenth's comment on the difficulty of statistics