ICML Expo Workshop Reliable and Efficient LLM Outputs with Mellea + Granite OSS Libraries

Expo Workshop

Reliable and Efficient LLM Outputs with Mellea + Granite OSS Libraries

Jake LoRocco ⋅ Kenney Ng ⋅ Paul Schweigert ⋅ Heiko Ludwig ⋅ Luis Lasras

AUDITORIUM

[ Abstract ] [ Project Page ]

Mon 6 Jul midnight PDT — 3 a.m. PDT

Abstract:

Every LLM application eventually runs into the same wall: the model generates plausible-sounding output that is wrong, off-format, or unsafe — and there is nothing between generation and delivery to catch it. Prompting the model harder helps sometimes. However, it is not reliable.

This workshop teaches a systematic approach to the problem using two open-source IBM tools: Mellea, a Python library for structured LLM generation, and Granite Libraries, a collection of lightweight LoRA adapters that score generated output against developer-defined requirements. Together they implement an Instruct-Validate-Repair loop — generate a response, measure it against your requirements, and select or retry before it reaches the user.

No cloud accounts, no audio hardware, no frontend build. A working environment takes under five minutes to set up.

What you will build: generative applications that grow from bare LLM calls to programs with validation and control flow.

What you will leave with: a mental model of how to enforce output quality programmatically, hands-on experience writing and tuning natural-language requirements, and a local codebase you can adapt to your own domain.

Technologies covered: Mellea, Granite Libraries (activated LoRA adapters), IBM Granite 4.0, Python, OpenAI-compatible inference backends (LM Studio, Ollama, vLLM).

All tools and models used are Apache 2.0 licensed and available on HuggingFace.

Live content is unavailable. Log in and register to view live content