AI in Business, Data Science, Oracle AI, Technical Solutions, Use Cases

Bringing Cross-Framework Observability to Oracle Open Agent Specification with Opik

June 30, 2026 3 minute read

Cesare Bernardis

Principal Member of Technical Staff

Damien Hilloulin

Senior Manager

Rhicheek Patra

Senior Director, Oracle AI Database

Sungpack Hong

Vice President, AI Research, Database

AI agents are becoming core components of enterprise applications, automating workflows, coordinating tools, and orchestrating complex tasks. However, a key challenge persists: fragmentation. Developers often build agents around a specific framework, then attach observability, evaluation, testing, and deployment workflows to that same framework.

This can make it difficult to compare runtimes, migrate agents, or standardize development practices across teams. A framework selected early in development can become a long-term architectural constraint, even as agentic frameworks continue to evolve.

To address this challenge, Opik is integrating with Oracle Open Agent Specification—Agent Spec in brief. Together, Agent Spec and Opik enable developers to define agents once, run them across compatible frameworks, and observe and evaluate their behavior through a consistent workflow.

Portable agent definitions with Agent Spec

Agent Spec is an open source, framework-agnostic configuration language for defining AI agents and workflows. It captures the core components of an agent—LLM settings, prompts, tools, and flow structure—in a portable representation that can be executed across compatible runtimes such as LangGraph, AutoGen, and WayFlow.

With Agent Spec, developers can separate the agent definition from the execution framework. This enables teams to preserve prompts, tool schemas, and orchestration logic while testing or migrating across different runtimes.

For enterprises, this portability helps reduce dependency on a single framework and supports shared development patterns across teams, projects, and deployment environments.

Opik for observability and evaluation

Opik provides tracing, debugging, and evaluation for LLM applications and agents. With the new Agent Spec integration, Opik can capture Agent Spec-defined executions consistently across supported runtimes.

This enables developers to inspect agent behavior across frameworks, including:

LLM calls, tool calls, and intermediate steps
Inputs, outputs, metadata, and final responses
Runtime-specific differences in behavior, latency, and cost
Evaluation results across the same agent definition

Because traces follow a consistent structure, teams can compare frameworks and agent changes without rebuilding observability or evaluation workflows for each runtime.

What this integration enables

The Opik and Agent Spec integration provides a modular foundation for agent development:

Define once, run across frameworks: Share and reuse Agent Spec configurations across compatible runtimes.
Observe consistently: Trace LLM calls, tool executions, intermediate steps, and outputs in Opik across frameworks.
Evaluate repeatedly: Run the same Opik evaluations and test suites across runtimes without framework-specific evaluation code.
Compare implementations: Benchmark latency, cost, and output quality when changing frameworks, prompts, tools, or LLMs.
Support enterprise workflows: Build shared CI/CD, testing, and governance processes across agents developed by different teams.

This separation of concerns allows the agent definition, runtime, and observability layer to evolve independently.

Evaluation and benchmarking

Once Agent Spec executions are captured in Opik, teams can apply repeatable evaluations across runtimes. Typical workflows include deterministic checks for output structure, required fields, and tool usage, as well as LLM-as-judge evaluations for qualities such as correctness, helpfulness, completeness, and relevance.

The same evaluation setup can be used to validate prompt changes, model swaps, tool updates, or runtime migrations. This helps teams identify regressions earlier and compare agent behavior using a common evaluation harness.

Getting started

You can start using Agent Spec agents with Opik in three steps:

Define your agent using Oracle’s PyAgentSpec SDK.
Load it onto a runtime using the appropriate adapter for your framework of choice.
Wrap execution with Opik’s AgentSpecInstrumentor to capture traces in Opik.

Cesare Bernardis

Principal Member of Technical Staff

Cesare Bernardis is a Principal Member of Technical Staff in the Oracle Database AI Research team. He works on agentic AI and large language models, and he is a core contributor to Open Agent Specification and WayFlow.

Damien Hilloulin

Senior Manager

Damien Hilloulin leads several AI/ML projects at Oracle. He is interested in the domains of machine learning models and systems, recommender systems and Agentic AI. He is currently focusing on Open Agent Specification, to make Agentic AI more reliable, interoperable and accessible.

Rhicheek Patra

Senior Director, Oracle AI Database

Rhicheek Patra leads multiple AI/ML projects at Oracle and joined Oracle after completing his Ph.D. at EPFL. He is interested in the domain of machine learning (including graph learning), privacy & security, and recommender systems. His research work has led to publications in many premier conferences like NAACL, ICML, VLDB, ICDE and others. While he likes to continue his research in the above-mentioned topics, he also likes to put them into practice.

Sungpack Hong

Vice President, AI Research, Database

Sungpack Hong is a Vice President of AI research in Oracle Database

Building a fault-tolerant training system with PyTorch Monarch on OKE

Dmitriy Ochakovskiy | 2 minute read

Building a Model Distillation Pipeline on OCI Data Science for Fraud Detection (Part-2)

Vijay Kumar | 2 minute read

Bringing Cross-Framework Observability to Oracle Open Agent Specification with Opik

Portable agent definitions with Agent Spec

Opik for observability and evaluation

What this integration enables

Evaluation and benchmarking

Getting started

Authors

Cesare Bernardis

Principal Member of Technical Staff

Damien Hilloulin

Senior Manager

Rhicheek Patra

Senior Director, Oracle AI Database

Sungpack Hong

Vice President, AI Research, Database

Building a fault-tolerant training system with PyTorch Monarch on OKE

Building a Model Distillation Pipeline on OCI Data Science for Fraud Detection (Part-2)