OCI Generative AI Tops Spider 2.0 Lite, Strengthening its NL2SQL leadership for Enterprise Data

April 23, 2026 4 minute read

Marianne Liu

Principal Applied Scientist

Chuan Lei

Senior Principal Applied Scientist

Jason Creighton

Senior Principal Product Manager

Ankan Bansal

Director Applied Science

Tao Sheng

Senior Director of Applied Science

Sujith Ravi

Vice President, GenAI

Dan Roth

Chief AI Scientist

Introduction

Generative AI is transforming how users interact with enterprise data. Instead of writing SQL, users can express intent in natural language and expect context-relevant results. This paradigm, Natural Language to SQL (NL2SQL), is key to scalable data access.

However, real-world NL2SQL remains challenging. Beyond scale and complexity, ambiguity is a central issue in enterprise data. The same query can admit multiple valid interpretations depending on business definitions, schema design, and data semantics, such as aggregation rules, filtering scope, or metric definitions.

These ambiguities often lead to SQL that is syntactically correct but semantically incorrect, a failure mode that is not addressed by standard generation or planning approaches.

Oracle recently ranked #1 in the 2025 Archer NL2SQL Evaluation Challenge with its Reasoning-based Natural Language to SQL Solution, demonstrating the value of structured planning and reasoning.

Building on this foundation, we further develop an enhanced solution focused on robustness and generalization in real-world settings, achieving #1 on the Spider 2.0 Lite leaderboard.

What is Spider 2.0 Lite?

Spider 2.0 is a NL2SQL benchmark that evaluates NL2SQL systems on realistic enterprise workflows, derived from production data applications.

Spider 2.0 Lite contains 547 examples across multiple SQL dialects (BigQuery, Snowflake, and SQLite). It retains key characteristics of enterprise NL2SQL:

Large, complex schemas (often 1,000+ columns)
Long-context reasoning over enterprise data
Multi-step SQL generation, sometimes exceeding 100 lines
Frequent ambiguity in user intent (e.g., aggregation rules, filtering scope, metric definitions)
Strong emphasis on execution accuracy and cross-dialect generalization

Result: #1 on Spider 2.0 Lite

Figure 1. Spider 2.0 Lite leaderboard results: Oracle's SOMA-SQL ranks #1, demonstrating strong performance on complex enterprise queries. — **Figure 1. Spider 2.0 Lite leaderboard results:** Oracle’s SOMA-SQL ranks #1, demonstrating strong performance on complex enterprise queries.

As shown in Figure 1, Oracle’s SOMA-SQL ranks #1 on the Spider 2.0 Lite leaderboard, achieving:

72.02% Execution Accuracy (EX@1)

This result extends Oracle’s leadership in NL2SQL following our #1 ranking on the Archer Challenge, and demonstrates strong robustness on realistic enterprise workloads across multiple SQL dialects.

Oracle’s Solution: SOMA-SQL

SOMA-SQL (Synthetic Query Logs and Probing for Multi-source Ambiguity Resolution) is a generalizable NL2SQL agent that resolves ambiguity across user queries, database schemas, and execution signals. As seen in Figure 2 below, instead of a single-pass mapping, it combines synthetic query logs, ambiguity-aware probing, schema enrichment, structured planning, and iterative SQL refinement to produce robust SQL across complex, multi-dialect enterprise environments.

We next describe the key components of SOMA-SQL.

1. Synthetic Query Logs (Ambiguity Grounding)

We construct an ambiguity-aware query log offline by generating (natural language question, corresponding SQL query) pairs aligned with the database schema, designed to capture and simulate common sources of ambiguity. At runtime, given a new natural language question, we retrieve the most relevant examples from this log and include them as few-shot context to improve the model’s ability to interpret and resolve ambiguities.

Construct ambiguity-aware query logs by transforming SQL variations into clarification questions (e.g., MCQs) with answers and explanations
Enrich the query context offline using these structured ambiguity cases
At inference time, retrieve relevant cases to guide interpretation and disambiguation

2. Schema Enrichment and Linking

We enrich the database schema offline, before any natural language query is presented, by incorporating semantic information, query-oriented documentation, and contextual signals to bridge the gap between how users express intent in natural language and how data is structured in the schema.

Augment schema with query-specific documentation and contextual signals
Improve alignment between user intent and database structure
Supports large, complex enterprise schemas

3. Structured Planning and SQL Generation

We introduce an explicit planning phase that decomposes the user query into intermediate reasoning steps and systematically explores multiple interpretations, enabling more robust and accurate SQL generation.

Decompose the user query into intermediate reasoning steps
Generate multiple candidate SQL queries to capture alternative interpretations

4. Critique and Refinement

We add an explicit critique phase that detects semantic issues in the generated SQL by integrating signals from both the database and the LLM, summarizes them in a structured report, and then uses that report to drive query correction before execution-based validation.

Analyze generated SQL for logical and semantic issues
Provide structured feedback to guide correction
Improves candidate quality before execution-based validation

5. Ambiguity-Driven Probing (Execution-Grounded Resolution)

We introduce an ambiguity probing phase that executes targeted checks against the database to distinguish competing SQL interpretations and resolve ambiguity using data-backed evidence.

Identify implementation differences across candidate SQLs and map them to ambiguity dimensions (intent, schema, value)
Generate targeted probing SQL queries to validate competing interpretations against the data
Convert probe outcomes into explicit resolution decisions
Apply data-validated fixes and select the final SQL

Conclusion

SOMA-SQL advances NL2SQL by directly addressing ambiguity in enterprise data. Achieving #1 on Spider 2.0 Lite demonstrates strong robustness and generalization across real-world workloads. This work will be integrated into the Oracle NL2SQL product and will enable customers to use natural language as a reliable interface for large-scale enterprise data systems.

Marianne Liu

Principal Applied Scientist

Marianne Liu is a Principal Applied Scientist and Tech Lead on the Generative AI Science team at Oracle Cloud Infrastructure (OCI), where she builds enterprise-grade AI systems for natural language interfaces and data platforms. Her work focuses on NL2SQL, agentic reasoning, and LLM evaluation, with an emphasis on reliability, ambiguity resolution, and real-world deployment. She has led the development of enterprise NL2SQL and agentic data systems at Oracle, contributing to state-of-the-art results on benchmarks such as Spider 2.0 Lite and Archer NL2SQL. Prior to Oracle, she was a Senior Data Scientist at IBM. She holds a Master’s degree from Carnegie Mellon University in Information Systems Management and a Bachelor’s degree (Honors) from Fudan University. She has published at venues such as ACL and serves as a reviewer for conferences including NeurIPS.

Sai Ashish Somayajula

Senior Applied Scientist

Sai Ashish Somayajula is a Senior Applied Scientist in the Generative AI team at Oracle Cloud Infrastructure (OCI), where he develops enterprise-scale AI systems focused on NL2SQL, analytics, and reasoning over structured and unstructured data. He earned his Ph.D. in Electrical and Computer Engineering from the University of California, San Diego, where he was a recipient of the Jacobs School of Engineering Departmental Fellowship. His research has been published in leading venues including NeurIPS, ICML, ACL, TACL, NAACL, TMLR, and Scientific Reports (Nature Portfolio). Ashish contributes actively to the research community as a reviewer and area chair for major conferences such as ACL, NAACL, EMNLP, ICML, ICLR, and NeurIPS, and has served as Student Research Workshop (SRW) Chair for EACL 2026. He holds a Master’s degree in Intelligent Systems and Robotics and a Bachelor’s degree in Electrical Engineering with a minor in Computer Science from IIT Hyderabad, where he received the Academic Excellence Award twice.

Chuan Lei

Senior Principal Applied Scientist

Chuan Lei is a Senior Principal Applied Scientist in Generative AI at Oracle Cloud Infrastructure (OCI), responsible for tackling problems in the intersection of generative AI, agents, data analytics, and machine learning. Before that, I led research and development efforts of multiple products at AWS, including Q for Generative SQL a text-to-SQL agent for Amazon Redshift, Structured Knowledge Bases in Amazon Bedrock, Q data integration in AWS Glue, and Amazon DataZone. Prior to joining AWS, I was a machine learning engineer at Instacart, a research staff member at IBM Research - Almaden and a researcher at NEC Laboratories America. Chuan received his Ph.D. in Computer Science from Worcester Polytechnic Institute.

Jason Creighton

Senior Principal Product Manager

Ankan Bansal

Director Applied Science

Ankan Bansal is a Director Applied Science at Oracle Cloud Infrastructure (OCI), leading science efforts in code generation and understanding, and agents for support. Before joining Oracle, Ankan was an Applied Science Manager at AWS where he led science teams for products including Amazon Quick Suite, Bedrock Data Automation, and Amazon Textract. Ankan received his Ph.D. in Electrical and Computer Engineering from University of Maryland, College Park, and his Bachelor's and Master's degrees from IIT Kanpur.

Tao Sheng

Senior Director of Applied Science

Tao Sheng is a Senior Director of Applied Science at Oracle Cloud Infrastructure (OCI). He is a seasoned AI researcher and technologist with over 16 years of industry experience since earning his PhD in 2010, with expertise spanning generative AI and LLMs, multimodal models, RAG, NL2SQL, and AI agents. He has held impactful roles at leading technology companies, including Oracle, Amazon, Qualcomm, and Intel, contributing to the development and launch of large-scale AI products used by millions of customers worldwide. Tao holds over 60 U.S. and international patents and has authored more than 20 peer-reviewed research papers. He also serves as a reviewer for top-tier AI conferences, actively contributing to the advancement of the field.

Syed Fahad Allam Shah

Senior Director

Syed Fahad Allam Shah is the Senior Director of Applied Science at Oracle leading efforts for Responsible AI, Verticalization and Code/SQL generation efforts. He has been working in AI space over 21 years building and delivering AI systems in the cloud, on device and as integrated offerings. He holds PhD and has served as reviewer, program committee and chair at multiple conferences (NeuRips, ICML, ICLR, AISTATS, KDD etc.).

Sujith Ravi

Vice President, GenAI

Dr. Sujith Ravi, Vice President of GenAI at Oracle Cloud Infrastructure (OCI), leads the development of transformative AI technologies, including Generative AI, Agents, and Multimodal AI, to revolutionize enterprise capabilities at cloud scale. With over 20 years of AI experience, he invented industry-defining innovations like Smart Reply (one of the first generative AI systems used by billions of people), TensorFlow Lite, and the first trillion-scale Graph ML platform. Previously, he founded SliceX AI and held strategic leadership roles at Amazon Alexa AI and Google AI, driving large-scale AI and machine learning advancements. A prolific author with over 100 publications and patents, Dr. Ravi is widely recognized for his contributions to AI research and innovation, serving as a board member for UCSC and, previously, as chair for AI at the National Academy of Engineering.

Dan Roth

Chief AI Scientist

Dan Roth is the Chief AI Scientist at Oracle and the Eduardo D. Glandt Distinguished Professor at the University of Pennsylvania. Previously, Dan was a VP/Distinguished Scientist at AWS AI where he led the scientific effort behind Amazon’s first- generation GenAI products, including Titan Models, Amazon Q, and Amazon Bedrock. Dan is a Fellow of the AAAS, ACM, AAAI, and ACL, and a recipient of the IJCAI John McCarthy Award “for major conceptual and theoretical advances in the modeling of natural language understanding, machine learning, and reasoning.” He has published broadly in natural language processing, machine learning, knowledge representation and reasoning, and learning theory, was the Editor-in-Chief of the Journal of Artificial Intelligence Research (JAIR) and has served as a Program Chair and Conference Chair for the major conferences in his research areas. Roth has been involved in several ML/NLP/GenAI startups in domains that range from legal and compliance to health care. Dan received his B.A Summa cum laude in Mathematics from the Technion, Israel and his Ph.D. in Computer Science from Harvard University in 1995.

OCI Generative AI Tops Spider 2.0 Lite, Strengthening its NL2SQL leadership for Enterprise Data

Introduction

What is Spider 2.0 Lite?

Result: #1 on Spider 2.0 Lite

Oracle’s Solution: SOMA-SQL

Conclusion

Marianne Liu

Principal Applied Scientist

Sai Ashish Somayajula

Senior Applied Scientist

Chuan Lei

Senior Principal Applied Scientist

Jason Creighton

Senior Principal Product Manager

Ankan Bansal

Director Applied Science

Tao Sheng

Senior Director of Applied Science

Syed Fahad Allam Shah

Senior Director

Sujith Ravi

Vice President, GenAI

Dan Roth

Chief AI Scientist

From Model Safety to Runtime Governance

Agentic AI in the Enterprise: A Practical Example in Inventory and Supplier Coordination

OCI Generative AI Tops Spider 2.0 Lite, Strengthening its NL2SQL leadership for Enterprise Data

Introduction

What is Spider 2.0 Lite?

Result: #1 on Spider 2.0 Lite

Oracle’s Solution: SOMA-SQL

Conclusion

Authors

Marianne Liu

Principal Applied Scientist

Sai Ashish Somayajula

Senior Applied Scientist

Chuan Lei

Senior Principal Applied Scientist

Jason Creighton

Senior Principal Product Manager

Ankan Bansal

Director Applied Science

Tao Sheng

Senior Director of Applied Science

Syed Fahad Allam Shah

Senior Director

Sujith Ravi

Vice President, GenAI

Dan Roth

Chief AI Scientist

From Model Safety to Runtime Governance

Agentic AI in the Enterprise: A Practical Example in Inventory and Supplier Coordination