Barbara Wu | AI Engineer & Open Source Contributor

Hi, my name is

Barbara Wu.

AI engineer with 7+ merged PRs into LangChain, Haystack, and CAMEL. I turn LLMs into production systems — RAG pipelines, intelligent agents, AI-powered workflows — on frameworks I know from the inside out. Trilingual (Chinese, French, English) with a PhD from Sorbonne University. Trained to think in systems, driven by curiosity to understand why — whether it's an LLM's reasoning or my own.

7+PRs Merged to Major OSS

3Languages (CN/FR/EN)

PhDSorbonne University

AI/MLPythonLangGraphLangChainRAGChromaDB

FrontendReactNext.jsTypeScriptTailwind CSS

BackendDjangoFastAPIPostgreSQLCelery

DevOpsDockerPrometheusGrafanaGitHub Actions

View My Work Get In Touch

Open to work — Based in Paris, available internationally

Ask My AI

LIVE

I'm an AI trained on my resume, projects, and experience. Ask me anything!

Projects

Full-Stack RAG Conversational AI

RAGTalk

Real-time conversations with historical figures grounded in primary sources. Agentic query routing, hybrid BM25+vector search, Reciprocal Rank Fusion, cross-encoder reranking, and SSE streaming.

Agentic RAG routerHybrid search + rerankingLLM-as-Judge evaluation

Next.jsFastAPIChromaDBTailwind CSSOpenRouter

Source Code

Multi-Agent Research Assistant

AI Researcher

AI-powered research assistant with LangGraph orchestration. Self-Refine, Reflexion, Corrective RAG, and Multi-Agent Debate patterns for deep research synthesis.

4 advanced RAG patternsMulti-agent debateFull-stack architecture

PythonLangGraphFastAPIChromaDBNext.js

Source Code

Enterprise Support Platform

IT Ticketing System

Internal IT support platform with AI-powered ticket analysis, async task processing, 4-language internationalization (CN/EN/FR/NL), and full observability stack.

4-language i18nAI ticket analysisFull monitoring stack

DjangoReactCeleryPostgreSQLDockerPrometheus/Grafana

Source Code

Other Noteworthy Projects

Personal Portfolio

This site! Built with Next.js 16, Matter.js physics engine, AI chatbot, and Tailwind CSS.

Next.jsTypeScriptMatter.jsAI SDK

Open Source Contributions

7+ merged PRs to LangChain (100k★), Haystack (18k★), CAMEL (10k★). Streaming fixes, memory leaks, API design.

PythonTypeScriptpytest

Experience

Contributing Developer @ Open Source

Feb 2026

Merged PRs into LangChain (100k★), Haystack (18k★), CAMEL (10k★) production codebases
Fixed streaming bugs, memory leaks, and data loss issues in major LLM frameworks
Designed API features including a flexible document comparison field for RAG evaluators
Wrote 7 in-depth technical blog posts analyzing each contribution's root cause and fix
Iterated on maintainer feedback to ensure backward compatibility and comprehensive test coverage

PythonTypeScriptpytestGitGitHub

Technical Skills

Languages

PythonTypeScriptJavaScriptSQL

Frontend

ReactNext.jsTailwind CSS

Backend

FastAPIDjangoCeleryNode.js

AI / ML

LangChainHaystackLangGraphChromaDBRAG

Infrastructure

DockerPostgreSQLRedisGitLinux

Blog

March 2026/20 min

Building a Zendesk Training Gym for LLM Agents: A Curriculum Engineer's Field Notes

How I built a mock Zendesk environment with state machine verification, Boltzmann adaptive curriculum, and WebRL self-evolving tasks — a functional RLVR training gym built in two days for a Curriculum Engineer interview.

RLVRCurriculum LearningLLM AgentsFastAPIReinforcement Learning

Feb 2026/18 min

Building an Agentic RAG System from Scratch: Architecture, Hybrid Search, and Adaptive Query Routing

A deep dive into RAG Talk — a conversational AI system with hybrid BM25+vector search, Reciprocal Rank Fusion, cross-encoder reranking, and an intelligent agentic routing layer that adapts retrieval strategy to query intent.

RAGFastAPIChromaDBLLMArchitecture

Feb 2026/25 min

Building a Production-Grade IT Ticketing System: Django, React, Celery, and AI-Powered Ticket Analysis

A comprehensive technical walkthrough of designing and building an enterprise IT support platform with async task processing, 4-language i18n, RAG-powered ticket analysis, and a full observability stack with Prometheus and Grafana.

DjangoReactCeleryDockerFull-Stack

Feb 2026/12 min

How I Fixed a Streaming Bug That Corrupted Parallel Tool Calls in LangChain

A deep dive into LangChain's merge_lists() function, why parallel tool calls from Bedrock/Anthropic got corrupted, and designing a backward-compatible fix at the framework level.

LangChainStreamingPython

Feb 2026/10 min

Hunting Down a Memory Leak in LangChain's lru_cache

How Python's descriptor protocol creates new method objects on every access, causing @lru_cache to have 0% hit rate but 100% memory retention. The fix: 15 lines.

PythonMemoryDebugging

Feb 2026/8 min

Adding Structured Reasoning to Haystack's Streaming Pipeline

Migrating Anthropic and Ollama integrations from unstructured meta dict access to proper ReasoningContent objects, with helper extraction and accumulation refactoring.

HaystackStreamingAnthropic

Feb 2026/8 min

Designing a Flexible Document Comparison API for Haystack Evaluators

Adding a document_comparison_field parameter to MRR, MAP, and Recall evaluators — enabling comparison by ID or metadata instead of hardcoded content matching.

HaystackRAGAPI Design

Feb 2026/8 min

Extracting Hyperlinks from PPTX and XLSX Files in Haystack

How pandas.read_excel() silently discards hyperlinks, and the openpyxl workaround to preserve them. Plus python-pptx run-level hyperlink extraction.

HaystackPythonRAG

Feb 2026/8 min

Fixing Lost Rich Output in CAMEL's E2B Code Interpreter

An early-return chain silently dropped matplotlib plots and images. The fix: replace it with an accumulation pattern that collects output from all sources.

CAMELPythonDebugging

Feb 2026/5 min

Improving Error Messages for Non-OpenAI Embedding Providers in LangChain

When OpenAIEmbeddings is used with third-party providers, cryptic errors replace helpful guidance. The fix: better error messages and documentation.

LangChainDXError Handling

Writing

On style, imitation, and what machines know about literature.

Feb 2026/8 min read

The Pasticheur’s Machine

What nineteenth-century literary theory knows about large language models, and what LLMs reveal about style, originality, and imitation.

Literary TheoryAINLP

Feb 2026/35 min readPDF

Pastiche: After the Manner of Vladimir Jankélévitch

A long-form essay that performs what it describes: an investigation of stylistic imitation written as a pastiche of Jankélévitch’s philosophical prose.

Comparative LiteraturePhilosophy

Get In Touch

I'm actively looking for Developer Relations and AI engineering roles. Whether you have a role that fits, a project idea, or just want to connect — my inbox is always open.

Say Hello