Wednesday Mar 09, 2016

Learn Predictive Analytics in 2 Days - New Oracle University Course!

What you will learn:  This Predictive Analytics using Oracle Data Mining Ed 1 training will review the basic concepts of data mining. Expert Oracle University instructors will teach you how to leverage the predictive analytical power of Oracle Data Mining, a component of the Oracle Advanced Analytics option.

Learn To:

  • Explain basic data mining concepts and describe the benefits of predictive analysis.
  • Understand primary data mining tasks, and describe the key steps of a data mining process.
  • Use the Oracle Data Miner to build, evaluate, apply, and deploy multiple data mining models.
  • Use Oracle Data Mining's predictions and insights to address many kinds of business problems.
  • Deploy data mining models for end-user access, in batch or real-time, and within applications.

Benefits to You

When you've completed this course, you'll be able to use the Oracle Data Miner 4.1, the Oracle Data Mining "workflow" GUI, which enables data analysts to work directly with data inside the database. The Data Miner GUI provides intuitive tools that help you explore the data graphically, build and evaluate multiple data mining models, apply Oracle Data Mining models to new data, and deploy Oracle Data Mining's predictions and insights throughout the enterprise.

Oracle Data Miner's SQL APIs - Get Results in Real-Time

Oracle Data Miner's SQL APIs automatically mine Oracle data and deploy results in real-time. Because the data, models, and results remain in the Oracle Database, data movement is eliminated, security is maximized and information latency is minimized.


  • Course Objectives
  • Suggested Course Prerequisites
  • Suggested Course Schedule
  • Class Sample Schemas
  • Practice and Solutions Structure
  • Review location of additional resources

Predictive Analytics and Data Mining Concepts

  • What is the Predictive Analytics?
  • Introducting the Oracle Advanced Analytics (OAA) Option?
  • What is Data Mining?
  • Why use Data Mining?
  • Examples of Data Mining Applications
  • Supervised Versus Unsupervised Learning
  • Supported Data Mining Algorithms and Uses

Understanding the Data Mining Process

  • Common Tasks in the Data Mining Process
  • Introducing the SQL Developer interface

Introducing Oracle Data Miner 4.1

  • Data mining with Oracle Database
  • Setting up Oracle Data Miner
  • Accessing the Data Miner GUI
  • Identifying Data Miner interface components
  • Examining Data Miner Nodes
  • Previewing Data Miner Workflows

Using Classification Models

  • Reviewing Classification Models
  • Adding a Data Source to the Workflow
  • Using the Data Source Wizard
  • Using Explore and Graph Nodes
  • Using the Column Filter Node
  • Creating Classification Models
  • Building the Models
  • Examining Class Build Tabs

Using Regression Models

  • Reviewing Regression Models
  • Adding a Data Source to the Workflow
  • Using the Data Source Wizard
  • Performing Data Transformations
  • Creating Regression Models
  • Building the Models
  • Comparing the Models
  • Selecting a Model

Using Clustering Models

  • Describing Algorithms used for Clustering Models
  • Adding Data Sources to the Workflow
  • Exploring Data for Patterns
  • Defining and Building Clustering Models
  • Comparing Model Results
  • Selecting and Applying a Model
  • Defining Output Format
  • Examining Cluster Results

Performing Market Basket Analysis

  • What is Market Basket Analysis?
  • Reviewing Association Rules
  • Creating a New Workflow
  • Adding a Data Source to the Workflow
  • Creating an Association Rules Model
  • Defining Association Rules
  • Building the Model
  • Examining Test Results

Performing Anomaly Detection

  • Reviewing the Model and Algorithm used for Anomaly Detection
  • Adding Data Sources to the Workflow
  • Creating the Model
  • Building the Model
  • Examining Test Results
  • Applying the Model
  • Evaluating Results

Mining Structured and Unstructured Data

  • Dealing with Transactional Data
  • Handling Aggregated (Nested) Data
  • Joining and Filtering data
  • Enabling mining of Text
  • Examining Predictive Results

Using Predictive Queries

  • What are Predictive Queries?
  • Creating Predictive Queries
  • Examining Predictive Results

Deploying Predictive models

  • Requirements for deployment
  • Deployment Options
  • Examining Deployment Options

Monday Feb 29, 2016

Guest Lecture on Big Data & Analytics to U. Kansas Business School Students

Recently, I was asked by a friend and colleague, Chris Claterbos, Lecturer at University of Kansas' Business School, to deliver a guest lecture to his business analytics students.  

In preparation, so as to not make it an entirely Oracle "product" presentation, I tried to gather some general information on the big data + analytics market, job opportunities & careers and future musings about where the industry is headed.  I liked resulting presentation so am posting and sharing it here.  

U. Kansas Guest Lecture on Big Data Analytics with Oracle's Advanced Analytics, Big Data SQL and Cloud

  • Big Data + Analytics “Phenomenon”
  • Careers in Big Data Analytics
  • Product
    • Oracle Advanced Analytics Overview & Features/Benefits
    • Brief Demos
  • Example Customer References
  • Applications “Powered by OAA”
  • Getting Started
  • Q & A

Enjoy!  Hopefully all you all become data'n'science stars!  


Wednesday Feb 03, 2016

Links to Presentations: BIWA Summit'16 - Big Data + Analytics User Conference Jan 26-28, @ Oracle HQ Conference Center

We had a great event with ~425 attendees, in depth technical presentations delivered by experts and even had several 2 hour Hands on Labs training classes that used the Oracle Database Cloud!  Watch for more coverage of event in various Oracle marketing and partner content venues.

Many thanks to all the BIWA board of directors and many volunteers who have put in so much work to make this BIWA Summit the best BIWA user event ever.  Mark your calendars for BIWA Summit’17, January 31, Feb. 1 & Feb. 2, 2017.  We’ll be announcing Call for Abstracts in the future, so please direct your best customers and speakers to submit.  We’re aiming to continue to make BIWA + Spatial + YesSQL Summit the best focused user gathering for sharing best practices for novel and interesting use cases of Oracle technologies.

BIWA is an IOUG SIG run by entirely by customers, partners and Oracle employee volunteers.  We’re always looking for people who would like to be involved.  Let me know if you’d like to contribute to the planning and organization of future BIWA events and activities.

See everyone at BIWA’17!

Charlie, on behalf of the entire BIWA board of directors  (

(see for more information)

See List of BIWA Summit'16 Presentations below.  Click on Details to access the speaker’s abstract and download the files (assuming the speaker has posted them for sharing).

We now have a schedule at a glance to show you all the sessions in a tabular agenda.


See bottom of page for the Session Search capability

Below is a list of the sessions and links to download most of the materials for the various sessions.  Click on the DETAILS button next to the session you want to download, then the page should refresh with the session description and (assuming the presenter uploaded files, but be aware that files may be limited to 5MB) you should see a list of files for that session.  See the full list below:

Advanced Analytics

Presentations (Click on Details to access file if submitted by presenter)

Dogfooding – How Oracle Uses Oracle Advanced Analytics To Boost Sales Efficiency


Oracle Modern Manufacturing - Bridging IoT, Big Data Analytics and ERP for Better Results


Predictive Modelling and Forecasting using OER


Enabling Clorox as Data Driven Enterprise


Fault Detection using Advanced Analytics at CERN's Large Hadron Collider: Too Hot or Too Cold


Large Scale Machine Learning with Big Data SQL, Hadoop and Spark


Stubhub and Oracle Advanced Analytics


Fiserv Case Study: Using Oracle Advanced Analytics for Fraud Detection in Online Payments


Advanced Analytics for Call Center Operations


Machine Learning on Streaming Data via Integration of Oracle R Enterprise and Oracle Stream Explorer


Learn Predictive Analytics in 2 hours!! Oracle Data Miner 4.0 Hands on Lab


Scaling R to New Heights with Oracle Database


Predictive Analytics using SQL and PL/SQL


Big Data Analytics with Oracle Advanced Analytics 12c and Big Data SQL and the Cloud


Improving Predictive Model Development Time with R and Oracle Big Data Discovery


Oracle R Enterprise 1.5 - Hot new features!


Is Oracle SQL the best language for Statistics


BI and Visualization

Presentations (Click on Details to access file if submitted by presenter)

Electoral fraud location in Brazilian General Elections 2014


The State of BI


Case Study of Improving BI Apps and OBIEE Performance


Preparing for BI 12c Upgrade


Data Visualization at Sound Exchange – a Case Study


Integrating OBIEE and Essbase, Why it Makes Sense


The Dash that changed a culture


Optimize Oracle Business Intelligence Analytics with Oracle 12c In-Memory Database option


Oracle Data Visualization vs. Answers: The Cage Match


What's New With Oracle Business Intelligence 12c


Workforce Analytics Leveraging Oracle Business Intelligence Cloud Serivces (BICS)


Defining a Roadmap for Migrating to Oracle BI Applications on ODI


See What’s There and What’s Coming with BICS & Data Visualization


Free form Data Visualization, Mashup BI and Advanced Analytics with BI 12c


Oracle Data Visualization Cloud Service Hands-On Lab with Customer Use Cases


On Metadata, Mashups and the Future of Enterprise BI


OBIEE 12c and the Leap Forward in Lifecycle Management


Supercharge BI Delivery with Continuous Integration


Visual Analyzer and Best Practices for Data Discovery


BI Movie Magic: Maps, Graphs, and BI Dashboards at AMC Theatres


Oracle Business Intelligence (OBIEE) the Smart View Way


Big Data

Presentations (Click on Details to access file if submitted by presenter)

Oracle Big Data: Strategy and Roadmap


Oracle Modern Manufacturing - Bridging IoT, Big Data Analytics and ERP for Better Results


Leveraging Oracle Big Data Discovery to Master CERN’s Control Data


Enrich, Transform and Analyse Big Data using Big Data Discovery and Visual Analyzer


Oracle Big Data SQL: Unified SQL Analysis Across the Big Data Platform


High Speed Video Processing for Big Data Applications


Enterprise Data Hub with Oracle Exadata and Oracle Big Data Appliance


How to choose between Hadoop, NoSQL or Oracle Database


Analytical SQL in the Era of Big Data


Cloud Computing

Presentations (Click on Details to access file if submitted by presenter)

Oracle DBaaS Migration Road Map


Centralizing Spatial Data Management with Oracle Cloud Databases


End Users data in BI - Data Mashup and Data Blending with BICS , DVCS and BI 12c


Oracle BI Tools on the Cloud--On Premise vs. Hosted vs. Oracle Cloud


Hybrid Cloud Using Oracle DBaaS: How the Italian Workers Comp Authority Uses Graph Technology


Build Your Cloud with Oracle Engineered Systems


Safe Passage to the CLOUD – Analytics


Your Journey to the Cloud : From Dedicated Physical Infrastructure to Cloud Bursting


Data Warehousing and ETL

Presentations (Click on Details to access file if submitted by presenter)

Getting to grips with SQL Pattern Matching


Making SQL Great Again (SQL is Huuuuuuuuuuuuuuuge!)


Controlling Execution Plans (without Touching the Code)


Taking Full Advantage of the PL/SQL Result Cache


Taking Full Advantage of the PL/SQL Compiler


Advanced SQL: Working with JSON Data


Oracle Database In-Memory Option Boot Camp: Everything You Need to Know


Best Practices for Getting Started With Oracle Database In-Memory


Extreme Data Warehouse Performance with Oracle Exadata


Real-Time SQL Monitoring in Oracle Database 12c


A Walk Through the Kimball ETL Subsystems with Oracle Data Integration


MySQL 5.7 Performance: More Than 1.6M SQL Queries per Second


Implement storage tiering in Data warehouse with Oracle Automatic Data Optimization


Edition-Based Redefinition Case Study


12-Step SQL Tuning Method


Where's Waldo? Using a brute-force approach to find an Execution Plan the CBO hides


Delivering an Enterprise-Wide Standard Chart of Accounts at GE with Oracle DRM


Agile Data Engineering: Introduction to Data Vault Data Modeling


Worst Practice in Data Warehouse Design


Same SQL Plan, Different Performance


Why Use PL/SQL?


Transforming one table to another: SQL or PL/SQL?


Understanding the 10053 Trace


Analytic Views - Bringing Star Queries into the Twenty-First Century


The Place of SQL in the Hybrid World


The Next Generation of the Oracle Optimizer


Internet of Things

Presentations (Click on Details to access file if submitted by presenter)

Oracle Modern Manufacturing - Bridging IoT, Big Data Analytics and ERP for Better Results


Meet Your Digital Twin


Industrial IoT and Machine Learning - Making Wind Energy Cost Competitive


Fault Detection using Advanced Analytics at CERN's Large Hadron Collider: Too Hot or Too Cold


Big Data and the Internet of Things in 2016: Beyond the Hype


IoT for Big Machines


The State of Internet of Things (IoT)


Oracle Spatial Summit

Presentations (Click on Details to access file if submitted by presenter)

Build Your Own Maps with the Big Data Discovery Custom Visualization Component


Massively Parallel Calculation of Catchment Areas in Retail


Dismantling Criminal Networks with Graph and Spatial Visualization and Analysis


Best Practices for Developing Geospatial Apps for the Cloud


Map Visualization in Analytic Apps in the Cloud, On-Premise, and Mobile


Best Practices, Tips and Tricks with Oracle Spatial and Graph


Delivering Smarter Spatial Data Management within Ordnance Survey, UK


Deploying a Linked Data Service at the Italian National Institute of Statistics


ATLAS - Utilizing Oracle Spatial and Graph with Esri for Pipeline GIS and Linear Asset Management


Oracle Spatial 12c as an Applied Science for Solving Today's Real-World Engineering Problems


Assembling a Large Scale Map for the Netherlands Using Oracle 12c Spatial and Graph


Using Open Data Models to Rapidly Develop and Prototype a 3D National SDI in Bahrain


Implementation of LBS services with Oracle Spatial and Graph and MapViewer in Zain Jordan


Interactive map visualization of large datasets in analytic applications


Gain Insight into Your Graph Data -- A hands on lab for Oracle Big Data Spatial and Graph


Applying Spatial Analysis To Big Data


Big Data Spatial: Location Intelligence, Geo-enrichment and Spatial Analytics


What’s New with Spatial and Graph? Technologies to Better Understand Complex Relationships


Graph Databases: A Social Network Analysis Use Case


High Performance Raster Database Manipulation and Data Processing with Oracle Spatial and Graph


3D Data Management - From Point Cloud to City Model


The Power of Geospatial Visualization for Linear Assets Using Oracle Enterprise Asset Management


Oracle Spatial and Graph: New Features for 12.2


Fast, High Volume, Dynamic Vehicle Routing Framework for E-Commerce and Fleet Management


Managing National Broadband Infrastructure at Turk Telekom with Oracle Spatial and Graph



Presentations (Click on Details to access file if submitted by presenter)

Taking Full Advantage of the PL/SQL Compiler


Taking Full Advantage of the PL/SQL Result Cache


Meet Your Digital Twin


Making SQL Great Again (SQL is Huuuuuuuuuuuuuuuge!)


Lightning Round for Vendors


Wednesday Dec 16, 2015

BIWA's Got Talent YouTube Demo Contest! - Enter and Win $500!!!

Best Oracle "Tech Stack" YouTube Demo Contest!

BIWA Wants YOU (Customers, Partners, Oracle Employees, whatever--everyone!) to post on YouTube one or multiple YouTube videos that highlight BIWA focused Oracle technologies/products/features or anything BIWA related!   See #BIWASGOTTALENT

Contest Details

Two categories

  • Customers, Partners, Students, Friends of BIWA--Anyone!
  • Oracle Employees--Note:  Any concerns about eligibility for Oracle employees is the responsibility of the employee

Judges will award points per the following scheme--MAX 100 points

  • Maximum of 40 points: Perception of usefulness and value added to the BIWA community, user or company
  • Maximum 25 points (5 points each): Each Oracle product or major feature highlighted e.g. 5 points for OAA, 5 points for Spatial, 5 points for OBIEE, BDA, BDD, etc.
  • Maximum of 10 points:  Completeness and clarity of associated documentation, reusable code, etc.
  • Maximum of 15 points:  Intangibles e.g. cleverness, sizzle, coolness, etc.--whatever excites and moves the judges
  • Maximum of 10 points: Most "likes" on YouTube

Each YouTube recorded "live" entry must include:

  • BIWAS GOT TALENT with BIWA Summit 2016 Logo (above on this page)
  • Title of your YouTube Video
  • Author(s), titles and contact information
  • Include #BIWASGOTTALENT in the meta information on YouTube
  • When submitting on YouTube and send an email to with a link
  • Presentation must be not to exceed more than 10 minutes of YouTube video. Submissions longer than 10 minute will be severely penalized by the judges. :(

The top two presentations in each category will be shown at BIWA Summit 2016 in Redwood Shores, California, January 26-28, 2016

Winners will be chosen based on a combination of the number of points received from the judges.  Submitters are encouraged to promote their #BIWASGOTTALENT video to accumulate "likes".  Prize can be taken as cash or donation to charity.

Rules, Regulations and Other Details:

  • By submitting your entry, you agree that BIWA may use your submission for marketing or other purposes
  • The winner will be notified by email by January 28th, 2016 and does not have to be present at BIWA Summit 2016 to win

For questions, please email

Monday Oct 12, 2015

NHS Business Services Authority Gains Better Insight into Data, Identifies circa GBP100 Million (US$156 Million) in Potential Savings in Just Three Months

NHS Business Services Authority Gains Better Insight into Data, Identifies circa GBP100 Million (US$156 Million) in Potential Savings in Just Three Months

The NHS Business Services Authority (NHSBSA) is a special health authority and an arm’s length body of the Department of Health for England. It provides a range of critical central services to NHS organizations, contractors, patients, and the public. Services include managing the NHS Pension schemes in England and Wales, managing payments to primary care dental and pharmacy contractors, and administering the European Health Insurance Card (EHIC).

The NHS budget for 2015/16 is approximately GBP116 billion (US$179 billion) and the total funds administered by the NHSBSA (including those for the NHS Pension schemes) amount to circa GBP32 billion (US$48 billion). The Department of Health asked the NHSBSA to take a proactive role to identify opportunities to reduce costs and eliminate waste. One way to do this was to find better ways to use the vast volumes of data already collected and held within the organization to help reduce fraud and error throughout the health service.

The NHSBSA needed a new, centralized solution that would enable it to gain better value from its data which is spread across a disparate set of IT systems, data, storage, and analytical capabilities. To achieve this, it chose an end-to-end Oracle solution including Oracle Advanced AnalyticsOracle Exadata Database MachineOracle Exalytics In-Memory MachineOracle Endeca Information Discovery, and Oracle Business Intelligence Enterprise Edition.

With this Oracle solution, the NHSBSA established its Data Analytics Learning Laboratory (DALL), investing in both technology and expertise to create insight from its data. Within the first three months of operation, the organization identified circa GBP100 million (US$156 million) in potential savings.

Uncovering Savings in Dentistry

A word from NHS Business Services Authority

  • “Oracle Advanced Analytics’ data mining capabilities and Oracle Exalytics’ performance really impressed us. The overall solution is very fast, and our investment very quickly provided value. We can now do so much more with our data, resulting in significant savings for the NHS as a whole.” – Nina Monckton, Head of Information Services, NHS Business Services Authority

The NHSBSA used analytics to identify significant savings within NHS dental services and find instances of activities which do not demonstrate good value for money.

“With Oracle Advanced Analytics, it is much easier to detect anomalies in behaviors. We used anomaly detection to discover where there might be evidence of inappropriate behavior in dentists’ claims, enabling NHS commissioners to follow up and challenge their activities,” explained Nina Monckton, head of information services, NHSBSA.

Preventing Fraud for European Health Insurance Card

The EHIC is available to all European citizens covered by a statutory social security scheme and entitles them to free healthcare while visiting other European countries. 

During analysis of EHIC data, the NHSBSA discovered commercial addresses being used fraudulently to apply for EHIC cards and uncovered the use of invalid NHS and National Insurance numbers to apply for a card. 

“We used Oracle Exalytics and Oracle Business Intelligence for the EHIC application to improve the front-end validation process, prevent fraud, and blacklist addresses showing suspicious activities,” Monckton said.

Analyzing Billions of Records in Minutes

The NHSBSA receives data relating to more than one billion prescription items dispensed in primary care settings each year. Previously, the NHSBSA did not have the computing power to analyze this data at transaction level.

The NHSBSA can now analyze billions of records at one time, and by analyzing much larger sets of patient data, the NHSBSA can provide insight that is helping to improve standards of care throughout the health service.

“Previously, our information analysts did not have the ability to directly query data as it was mainly held in live operational systems. Now that we are able to transfer data to our Exadata environment, we have dramatically improved our ability to deliver value from our data,” Monckton said.

Analyzing Unstructured Text to Measure Satisfaction

Improving Data Matching To Save Millions of Dollars

In England, some people are entitled to free medical prescriptions or dental treatment from the NHS. The NHSBSA works with the Department of Work and Pensions (DWP) to establish that those patients declaring that they are exempt from a charge for dental treatment and/or medical prescriptions are claiming correctly. Using Oracle Exalytics to compare datasets, the NHSBSA reduced the rate of non-matching records for dentistry from 15% to just 5%.

The Role of Data Governance

Data is now moving to the heart of all NHSBSA programs. As a result of the organization’s new analytics capability, teams have a better understanding of what they can do with the data and are more careful about what data they collect. 

“We now know that if we collect the right data at the start of a program, we can measure what is working down the line. We are starting to change the culture of the organization around our data governance. There has been a massive shift. Data is now central to all our new programs, and data governance is at the heart of everything we do,” Monckton said.

Using the Data Analytics Learning Laboratory to Achieve Strategic Goals

The NHSBSA’s data analytics investment is helping the organization to achieve its 5 year strategic goals, which include helping to save GBP1 billion (US$1.56 billion) for NHS patients, reducing unit costs by 50%, improving service and delivering great results for customers, and deriving insight from data to drive change.

“With our newly established Data Lab in place, we can add even more value to the NHS. I cannot begin to describe how significant that has been. This project is really helping us to achieve our strategic goals. In addition, we are working in a different way now and it has even helped with how people interact and function in the workplace.

“We’ve had a very positive response, and our chief executive is extremely impressed with our achievements and the results we have shown so far. As a result, management is recommending that our suppliers and partners come to see what we are doing to learn from our experiences,” Monckton said.

Over the next six months, the DALL team has a large number of analytics projects in the pipeline and is looking to help other areas of the business to better leverage their data. The organization will focus on how it can use Oracle Business Intelligence Enterprise Edition with business users. In addition, the NHSBSA is investigating how it might share data and its analytical ability with other government organizations to drive further value from its investment.


  • Use new insight gathered from data to help identify cost savings and meet NHSBSA strategic goals
  • Identify and prevent healthcare fraud and benefit eligibility errors to save costs
  • Leverage existing data to transform business and productivity


Oracle Product and Services

  • Identified up to GBP100 million (US$156 million) that could potentially be saved across the NHS through benefit fraud and error reduction, by deploying new analytics infrastructure
  • Identified and implemented changes to prevent fraudulent European Health Insurance Card (EHIC) applications
  • Used data matching to identify savings that can be made through the recovery of money from patients claiming exemption from charges for dental treatment or prescriptions when not eligible to do so
  • Used anomaly detection to uncover fraudulent activity where some dentists split a single course of treatment into multiple parts and presented claims for multiple treatments
  • Analyzed unstructured text to measure employee satisfaction in more detail and found a direct link between those who felt less engaged at work and those more likely to take time off sick
  • Analyzed billions of records at one time to measure longer-term patient journeys and to analyze drug prescribing patterns to improve patient care
  • Established a new Data Analytics Learning Laboratory (DALL) that uses data and analytics to drive action and significant savings for the NHS
  • Implemented Oracle Advanced Analytics, Oracle Exadata Database Machine, Oracle Exalytics In-Memory Machine, Oracle Endeca Information Discovery, and Oracle Business Intelligence Enterprise Edition to deliver fast analysis and data mining for NHS and wider government departments

Why Oracle

“We chose Oracle because the solution could cope with very large data volumes running into billions of rows and could scale as volumes increase. In addition, the Oracle solution required no IT team support to run the queries, which enables our team of data analysts to be self-sufficient. Oracle Exalytics’ in-memory capability gave us the speed we required, and Oracle’s engineered systems accelerated deployment and reduced risk.

“Working with Oracle has been a very positive experience. The team has been incredibly responsive and provided a number of experts to help us get up and running as quickly as possible. With one vendor providing the whole solution, it’s very easy for us. If we need help, we know where to go,” Monckton said.

Implementation Process

Oracle ran a proof of concept (POC) to show the speed and capability of the proposed end-to-end solution. The POC used publically available data sets for NHS prescription data. It covered 50 million prescribed items, 300 million records, and six months of data. The team concentrated on finding anomalies in the data and carrying out further analysis to understand them before presenting the findings in a clear and straightforward way.

Following the POC, Oracle worked with NHSBSA and its data center partner, Capita, to complete the implementation. During implementation, Oracle provided the NHSBSA with access to a virtual environment. This enabled the team to get some experience with the tools before completing the implementation. As such, NHSBSA was familiar and confident with using the new analytics tools from day one, saving considerable time and gaining immediate value.

NHSBSA identified which data it should use for analysis and transferred it across to its Oracle Exadata environment. To date it has transferred more than 15 billion rows of data into Oracle Exadata. The prescription services database with 14 billion rows of data is the largest exported data source using 400 gigabytes. The export took 10 hours to complete with Oracle as the source database. 

Advice from NHSBSA

  • Have a clear plan for the first six months before you begin your implementation
  • Ensure you have buy-in from key stakeholders
  • Choose easy areas to start with, so you can demonstrate positive results quickly and prove the value of the solution to others
  • Build knowledge within your team through training and Oracle events; this helps staff to think differently about the possibilities of using data
  • Get help from the experts: talk to your existing suppliers, go to analytics events, and talk to other organizations who have implemented analytics
  • It’s never too early to think about data governance and data quality: recruit a data standards manager to create data governance policies and identify data leads around the business


Friday Sep 25, 2015

Oracle Advanced Analytics at Oracle Open World 2015

While there are a lot of OOW talks that include the work “analytics” or “big data”, this is my short list of sessions, training and demos that primarily focus on Oracle Advanced Analytics. Hope to see you there!


Oracle Advanced Analytics at OOW'15 Highlights

Big Data Analytics with Oracle Advanced Analytics12c and Big Data SQL &
Fiserv Case Study: Fraud Detection in Online Payments [CON8743]

Tuesday, Oct 27, 5:15 p.m. | Moscone South—307

· Charles Berger, Sr. Director of Product Management, Advanced Analytics and Data Mining, Oracle

· Miguel M Barrera, Director of Risk Analytics and Strategy

· Julia Minkowski, Risk Analytics Manager

Oracle Advanced Analytics 12c delivers parallelized in-database implementations of data mining algorithms and integration with R. Data analysts use Oracle Data Miner GUI and R to build and evaluate predictive models and leverage R packages and graphs. Application developers deploy Oracle Advanced Analytics models using SQL data mining functions and R. Oracle extends Oracle Database to an analytical platform that mines more data and data types, eliminates data movement, and preserves security to automatically detect patterns, anticipate customer behavior, and deliver actionable insights. Oracle Big Data SQL adds new big data sources and Oracle R Advanced Analytics for Hadoop provides algorithms that run on Hadoop. 

Fiserv manage risk for $30B+ in transfers, servicing 2,500+ US financial institutions, including 27 of the top 30 banks and prevents $200M in fraud losses every year.  When dealing with potential fraud, reaction needs to be fast.  Fiserv describes their use of Oracle Advanced Analytics for fraud prevention in online payments and shares their best practices and results from turning predictive models into actionable intelligence and next generation strategies for risk mitigation.  
Conference Session

OAA Demo Pod (#3581—Big Data Predictive Analytics with Oracle Advanced Analytics, R, and Oracle Big Data SQL   Moscone South

The Oracle Advanced Analytics database option embeds powerful data mining algorithms in Oracle Database’s SQL kernel and adds integration with R for solving big data problems such as predicting customer behavior, anticipating churn, detecting fraud, and performing market basket analysis. Data analysts work directly with database data, using the Oracle Data Miner workflow GUI (SQL Developer 4.1 ext.), SQL, or R languages and can extend Oracle Advanced Analytics’ functionally with R graphics and CRAN packages. Oracle Big Data SQL enables Oracle Advanced Analytics models to run on Oracle Big Data Appliance. Oracle R Advanced Analytics for Hadoop provides a powerful R interface over Hadoop and Spark with parallel-distributed predictive algorithms. Learn more in this demo.

Real Business Value from Big Data and Advanced Analytics [UGF4519]

Sunday, Oct 25, 3:30 p.m. | Moscone South—301

· Antony Heljula, Technical Director, Peak Indicators Limited

· Brendan Tierney, Principal Consultant, Oralytics

Attend this session to hear real case studies where big data and advanced analytics have delivered significant return on investment to a variety of Oracle customers. These solutions can pay for themselves within one year. Customer case studies include predicting which employees are likely to leave within the next 12 months, predicting which sales outlets are likely to suffer from out-of-stock products, predicting sales based on the weather forecast, and predicting which students are likely to withdraw early from their courses. A live demonstration illustrates the high-level process for implementing predictive business intelligence (BI) and its best practices.  User Group Forum Session

Customer Panel: Big Data and Data Warehousing [CON8741]

Wednesday, Oct 28, 4:15 p.m. | Moscone South—301

· Craig Fryar, Head of Wargaming Business Intelligence,

· Manuel Martin Marquez, Senior Research Fellow and Data Scientist, Cern Organisation Européenne Pour La Recherche Nucléaire

· Jake Ruttenburg, Senior Manager, Digital Analytics, Starbucks

· Chris Wones, Chief Enterprise Architect, 8451

· Reiner Zimmermann, Senior Director, DW & Big Data Global Leaders Program, Oracle

In this session, hear how customers around the world are solving cutting-edge analytical business problems using Oracle Data Warehouse and big data technology. Understand the benefits of using these technologies together, and how software and hardware combined can save money and increase productivity. Learn how these customers are using Oracle Big Data Appliance, Oracle Exadata, Oracle Exalytics, Oracle Database In-Memory 12c, or Oracle Analytics to drive their business, make the right decisions, and find hidden information. The conversation is wide-ranging, with customer panelists from a variety of industries discussing business benefits, technical architectures, implementation of best practices, and future directions.  Conference Session

End-to-End Analytics Across Big Data and Data Warehouse for Data Monetization [CON3296]

Monday, Oct 26, 4:00 p.m. | Moscone West—2022

· Satya Bhamidipati, Senior Principal Advanced Analytics Market Dev, Business Analytics Product Group, Oracle

· Gokula Mishra, VP, Big Data & Advanced Analytics, Oracle

Organizations have used data warehouses to manage structured and operational data, which provides business analysts with the ability to analyze key internal data and spot trends. However, the explosion of newer data sources (big data) not only challenges the role of the traditional data warehouse in analyzing data from these diverse sources but also exposes limitations posed by traditional software and hardware platforms. This newer data can be combined with the data in the data warehouse and analyzed without creating another data silo and creating a hybrid data analytics structure. This presentation discusses the data and analytics platform architecture that enables this data monetization and presents various industry use cases.  Conference Session

Building Predictive Models for Identifying and Preventing Tax Fraud [CON3294]

Wednesday, Oct 28, 9:00 a.m. | Park Central—Concordia

· Brian Bequette, Managing Partner, TPS

· Satya Bhamidipati, Senior Principal Advanced Analytics Market Dev, Business Analytics Product Group, Oracle

According to a TIGTA Audit Report issued in February 2013, in 2012 alone, the IRS identified almost 642,000 incidents of identity theft affecting tax administration, a 38 percent increase since 2010. And this number continues to increase. Tax Processing Systems (TPS) consultants have focused on fraud detection and developed innovative solutions and proprietary algorithms for detecting fraud. In 2012, TPS formed a partnership with Oracle and has adapted its cloud-based methodologies and algorithms for use on the Oracle technology stack. Together, TPS and Oracle have created an end-to-end fraud detection solution that is effective, efficient, and accurate. This presentation focuses on the technology and the algorithms they have developed to detect fraud.  Conference Session

Oracle University Pre-OOW Course – Sunday, Oct. 25th

Using Data Mining Techniques for Predictive Analysis Course, Sunday October 25th

This session teaches students the basic concepts of data mining and how to leverage the predictive analytical power of data mining with Oracle Database by using Oracle Data Miner 12c. Students will learn how to explore the data graphically, build and evaluate multiple data models, apply data mining models to new data, and deploy data mining's predications and insights throughout the enterprise. All this can be performed on the data in Oracle Database on a real-time basis by using Oracle Data Miner SQL APIs. As the data, models, and results remain in Oracle Database, data movement is eliminated, security is maximized, and information latency is minimized.
See Oracle University at Oracle OpenWorld and Make the Most of Your Oracle OpenWorld and JavaOne Experience with Preconference Training by Oracle Experts

When: Sunday, October 25, 2015, 9 a.m.-4 p.m., with a one-hour lunch break
Where: Golden Gate University, 536 Mission Street, San Francisco, CA 94105 (three blocks from Moscone Center)
Cost: US$850 for a full day of training (cost includes light refreshments and a boxed lunch)

Instructor: Ashwin Agarwal… Read full bio

Target Audience: Data scientists, application developers, and data analysts

Course Objectives:

  • Understand the basic concepts and describe the primary terminology of data mining
  • Understand the steps associated with a data mining process
  • Use Oracle Data Miner 12c to perform data mining
  • Understand the options for deploying data mining predictive results

Course Topics:

  • Understanding the Data Mining Concepts
  • Understanding the Benefits of Predictive Analysis
  • Understanding Data Mining Tasks
  • Key Steps of a Data Mining Process (Includes Demo)
  • Using Oracle Data Miner to Build, Evaluate, and Apply Multiple Data Mining Models Includes Demo)
  • Using Data Mining Predictions and Insights to Address Various Business Problems (Includes Demo)
  • Predicting Individual Behavior (Includes Demo)
  • Predicting Values (Includes Demo)
  • Finding Co-Occurring Events (Includes Demo)
  • Detecting Anomalies (Includes Demo)
  • Learning How to Deploy Data Mining Results for Real-Time Access by End Users

Prerequisites: A working knowledge of the SQL language and Oracle Database design and administration

Also, on the Big Data + Analytics related products OTN pages, there is a “Must See” Program Guide. Clicking on the .pdf link you’ll see the full list.

Friday Aug 07, 2015

Oracle Advanced Analytics Oracle University (OU) Classes in Cambridge, MA. September 28-Oct. 1, 2015

Oracle University has rescheduled their 2 day back to back Oracle Advanced Analytics OU Classes in Cambridge, MA.   Please help spread the word. 

Oracle Advanced Analytics combo-course (ODM + ORE) training

This is great opportunity for big data analytics customers and partners to learn hands on about using Oracle Advanced Analytics.  Vlamis, authorized OU instructor(s), will be teaching the OAA/ODM & OAA/ORE courses again and have been a great and knowledgeable OAA training and implementation partner. The courses are also during the week of Predictive Analytics World in Boston (Oracle will be exhibiting and speaking) so perhaps a good time for customers to come to Boston, perhaps use some OU credits, learn some new skills and focus on Oracle’s predictive analytics. 

Anyone (customers and Oracle Employees) can register through us at or via their normal OU connections. They should be able to utilize OU training credits for either course.  Oracle Employees should register through the Employee Self Service from Self Service Applications

Please forward to any appropriate Oracle Advanced Analytics customers and partners.  Thanks!


Sunday Jul 26, 2015

Big Data Analytics with Oracle Advanced Analytics: Making Big Data and Analytics Simple white paper

Big Data Analytics with Oracle Advanced Analytics:

Making Big Data and Analytics Simple

Oracle White Paper  |  July 2014 

Executive Summary:  Big Data Analytics with Oracle Advanced Analytics

(Click HERE to read entire Oracle white paper)   (Click HERE to watch YouTube video)

The era of “big data” and the “cloud” are driving companies to change.  Just to keep pace, they must learn new skills and implement new practices that leverage those new data sources and technologies.  Increasing customer expectations from sharing their digital exhaust with corporations in exchange for improved customer interactions and greater perceived value are pushing companies forward.  Big data and analytics offer the promise to satisfy these new requirements.  Cloud, competition, big data analytics and next-generation “predictive” applications are driving companies towards achieving new goals of delivering improved “actionable insights” and better outcomes.  Traditional BI & Analytics approaches don’t deliver these detailed predictive insights and simply can’t satisfy the emerging customer expectations in this new world order created by big data and the cloud.

Unfortunately, with big data, as the data grows and expands in the three V’s; velocity, volume and variety (data types), new problems emerge.  Data volumes grow and data becomes unmanageable and immovable.  Scalability, security, and information latency become new issues.  Dealing with unstructured data, sensor data and spatial data all introduce new data type complexities.  

Traditional advanced analytics has several information technology inherent weak points: data extracts and data movement, data duplication resulting in no single-source of truth, data security exposures, separate and many times, depending on the skills of the data analysts/scientists involved, multiple analytical tools (commercial and open source) and languages (SAS, R, SQL, Python, SPSS, etc.).  Problems become particularly egregious during a deployment phase when the worlds of data analysis and information management collide.   

Traditional data analysis typically starts with a representative sample or subset of the data that is exported to separate analytical servers and tools (SAS, R, Python, SPSS, etc.) that have been especially designed for statisticians and data scientists to analyze data.  The analytics they perform range from simple descriptive statistical analysis to advanced, predictive and prescriptive analytics.  If a data scientist builds a predictive model that is determined to be useful and valuable, then IT needs to be involved to figure out deployment and enterprise deployment and application integration issues become the next big challenge. The predictive model(s)—and all its associated data preparation and transformation steps—have to be somehow translated to SQL and recreated inside the database in order to apply the models and make predictions on the larger datasets maintained inside the data warehouse.  This model translation phase introduces tedious, time consuming and expensive manual coding steps from the original statistical language (SAS, R, and Python) into SQL.  DBAs and IT must somehow “productionize” these separate statistical models inside the database and/or data warehouse for distribution throughout the enterprise.  Some vendors will charge for specialized products and options for just for predictive model deployment.  This is where many advanced analytics projects fail.  Add Hadoop, sensor data, tweets, and expanding big data reservoirs and the entire “data to actionable insights” process becomes more challenging.  

Not with Oracle.  Oracle delivers a big data and analytics platform that eliminates the traditional extract, move, load, analyze, export, move load paradigm.  With Oracle Database 12c and the Oracle Advanced Analytics Option, big data management and big data analytics are designed into the data management platform from the beginning.  Oracle’s multiple decades of R&D investment in developing the industry’s leading data management platform, Oracle SQL, Big Data SQL, Oracle Exadata, Oracle Big Data Appliance and integration with open source R are seamlessly combined and integrated into a single platform—the Oracle Database.  

Oracle’s vision is a big data and analytic platform for the era of big data and cloud to:

  • Make big data and analytics simple (for any data size, on any computer infrastructure and any variety of data, in any combination) and

  • Make big data and analytics deployment simple (as a service, as a platform, as an application)

Oracle Advanced Analytics offers a wide library of powerful in-database algorithms and integration with open source R that together can solve a wide variety of business problems and can be accessed via SQL, R or GUI.  Oracle Advanced Analytics, an option to the Oracle Database Enterprise Edition 12c, extends the database into an enterprise-wide analytical platform for data-driven problems such as churn prediction, customer segmentation, fraud and anomaly detection, identifying cross-sell and up-sell opportunities, market basket analysis, and text mining and sentiment analysis.  Oracle Advanced Analytics empowers data analyst, data scientists and business analysts to more extract knowledge, discover new insights and make informed predictions—working directly with large data volumes in the Oracle Database.   

Data analysts/scientists have choice and flexibility in how they interact with Oracle Advanced Analytics.  Oracle Data Miner is an Oracle SQL Developer extension designed for data analysts that provides an easy to use “drag and drop” workflow GUI to the Oracle Advanced Analytics SQL data mining functions (Oracle Data Mining).  Oracle SQL Developer is a free integrated development environment that simplifies the development and management of Oracle Database in both traditional and Cloud deployments. When Oracle Data Miner users are satisfied with their analytical methodologies, they can share their workflows with other analysts and/or generate SQL scripts to hand to their DBAs to accelerate model deployment.  Oracle Data Miner also provides a PL/SQL API for workflow scheduling and automation.  

R programmers and data scientists can use the familiar open source R statistical programming language console, RStudio or any IDE to work directly with data inside the database and leverage Oracle Advanced Analytics’ R integration with the database (Oracle R Enterprise).  Oracle Advanced Analytics’ Oracle R Enterprise provides transparent SQL to R translation to equivalent SQL and Oracle Data Mining functions for in-database performance, parallelism, and scalability—this making R ready for the enterprise.  

Application developers, using the ODM SQL data mining functions and ORE R integration can build completely automated predictive analytic solutions that leverage the strengths of the database and the flexibly of R to integrate Oracle Advanced Analytics analytical solutions into BI dashboards and enterprise applications.

By integrating big data management and big data analytics into the same powerful Oracle Database 12c data management platform, Oracle eliminates data movement, reduces total cost of ownership and delivers the fastest way to deliver enterprise-wide predictive analytics solutions and applications.  

(Click HERE to read entire Oracle white paper)

Friday Jul 24, 2015

2015 BIWA SIG Virtual Conference - Two Days of "Live" Talks by Experts - FREE

2015 BIWA SIG Virtual Conference

July 30-31, 2015 9:00 a.m. - 1:00 p.m. CDT

Join us for two full days where you will hear about the latest Business Intelligence trends. 

Day One:

  • 9:00 a.m. - 10:00 a.m.: What’s new in Oracle EPM and BI Infrastructure - Eric Helmer, ADI Strategies

Hyperion EPM abd BI Fusion edition is a dramatic change under the covers. Corporations must consider more globalapproaches to infrastructure to maintain availability and performance while reducing footprint and cost. Technologies such as Exalytics, Oracle virtualization, cloud computing, software as a service, etc and open source operating systems (Linux) are more commonplace. Join Oracle Are Director Eric Helmer as he covers what’s new, what’s supported, and what options you have when implementing your EPM/BI project.

  • 10:00 a.m. - 11:00 a.m.Italian Ministry of Labor & Social Policy -- A Journey to Digital Government - Nicola Sandoli, ICONSULTING

The Italian Ministry of Labor and Social Policy (MLPS) is a branch of the Italian government responsible for all labormatters, including employment policies, promotions, worker protection, and social security. In its evolution towards a digital government, MLPS is streamlining and simplifying its administrative processes. MLPS has embarked on a data-driven journey to redefine business models and interactions with citizens – and optimize and transform government services. MLPS is focusing on four areas: - Information delivery: transitioning its data warehouse platform from reporting to centralizing and certifying data - Business Intelligence: monitoring activities, web publishing, and analyzing socio–political impact - Web analytics and semantic intelligence: interacting more efficiently with citizens - Job-hunting online guidance services: real time answers to young people looking for jobs MLPS is using a wide range of Oracle technologies to manage large amounts of diverse data, and apply advanced analytics, including - Oracle Exalytics for daily updates of 5TB of data - Oracle Spatial and Graph and MapViewer 11g for location intelligence capabilities - Oracle Business Intelligence for desktop and mobile reporting - Oracle Endeca Information Discovery for web analytics, data discovery, and data analysis using social and semantic intelligence - Oracle Real-Time Decisions - Oracle Service-Oriented Architecture Suite: central point for accessing and managing information made available through the Ministry web portal Cliclavoro Learn more about MLPS and its innovative platform that is delivering better information and services to their constituents.

  • 11:00 a.m. - 12:00 p.m.Exadata:  Elastic Configurations and IaaS – Private Cloud - Amit Kanda, Oracle

Customers are faced with challenges in their business, which include taking real time data driven decisions and  reducing costs.  Exadata’s extreme performance combined with Database In-Memory answer the real time data driven decisions. Elastic configurations and an updated subscription model (IaaS – Private Cloud) for Exadata  hardware and software accompanied the launch of Exadata X5–2.  This presentation will describe these updates and how customers can start small with Exadata and grow Exadata with their business – making it easier to reach business objectives.

  • 12:00 p.m. - 1:00 p.m.The State of Internet of Things (IoT) - Shyam Varan Nath, GE

The Internet of Things or IoT is poised to have a tremendous amount of impact around us. This session will look at  the industry landscape of IoT. The different flavors of IoT will be discussed with use cases from the consumer,  commercial and industrial sectors. Learn about the edge and cloud computing platforms to power the IoT solutions.  Finally, walk-thru of use-cases that show how machine/sensor data is being monetized through analytics. Such use  cases will span Aviation and other industries.

Day Two:

  • 9:00 a.m. - 10:00 a.m.: Big Data Analytics with Oracle Advanced Analytics 12c and Big Data SQL - Charlie Berger, Oracle

Oracle Advanced Analytics 12c, delivers parallelized in-database implementations of data mining algorithms andintegration with R. Data analysts use Oracle Data Miner GUI and R to build and evaluate predictive models and leverage R packages and graphs. Application developers deploy OAA models using SQL data mining functions and R. Oracle extends the Database to an analytical platform that mines more data and more data types, eliminates data movement and preserves security to automatically detect patterns and anticipate customer behavior and deliver actionable insights. Oracle Big Data SQL adds new big data sources and ORAAH provides algorithms that run on Hadoop. Come learn what’s new, best practices, and hear customer examples.

  • 10:00 a.m. - 11:00 a.m.: Graph Data Management and Analytics for Big DataBill Beauregard, Oracle & Zhe Wu, Oracle

The newest Oracle big data product, Oracle Big Data Spatial and Graph, offers a set of spatial analytic services, and a graph database with rich graph analytics that support big data workloads on Apache Hadoop and NoSQL technologies. Oracle is applying over a decade of expertise with spatial and graph analytic technologies to big data architectures. Graphs are an important data model for big data systems. Property graphs can be used for discovery, for instance, to discover underlying communities and influencers within a social graph, relationships and connections in cyber security networks, and to generate recommendations based on interests, profiles, and past behaviors. Oracle Big Data Spatial and Graph provides optimized storage, search and querying in Oracle NoSQL Database and Apache HBase for distributed property graphs. It offers 35 built-in, in-memory, parallel property graph analytic functions. We will discuss use cases, features, architecture, and show a demo. Learn how developers and data scientists can manage their most challenging graph data processing in a single enterprise-class Big Data platform.

  • 11:00 a.m. - 12:00 p.m.Why Oracle Database In-Memory?  Use Cases and Overview - Andy Rivenes, Oracle

Oracle recently announced the availability of the Oracle Database In-Memory option, a memory-optimized database technology that transparently adds real-time analytics to applications. Because the In-Memory option is 100% compatible with existing Oracle Database applications, it’s easy to integrate it into your environment and to begin reaping the benefits. But how do you get started with it? What do you need to know to take full advantage of this new functionality? This session will give an overview of what Oracle Database In-Memory is and then discuss some use cases to highlight how it can be used.

| Register Here |

Wednesday Jul 15, 2015

Call for Abstracts at BIWA Summit'16 - The Oracle Big Data + Analytics User Conference

Please email with any questions regarding the submission process.

What Successes Can You Share?

We want to hear your story. Submit your proposal today for the Oracle BIWA Summit 2016.

Proposals will be accepted through Monday evening, November 2, 2015, at midnight, EST. Don’t wait, though—we’re accepting submissions on a rolling basis, so that selected sessions can be published early on our online agenda.

To submit your abstract, click here, select a track, fill out the form.

Please note:

  • Presentations must be noncommercial.
  • Sales promotions for products or services disguised as proposals will be eliminated. 
  • Speakers whose abstracts are accepted will be expected to submit (at a later date) a PowerPoint presentation slide set. 
  • Accompanying technical and use case papers are encouraged, but not required.

Speakers whose abstracts are accepted will be given a complimentary registration to the conference. (Any additional co-presenters must register for the event separately and provide appropriate registration fees. It is up to the co-presenters’ discretion which presenter to designate for the complimentary registration.) 

This Year’s Tracks

Proposals can be submitted for the following tracks: 

More About the Conference

The Oracle BIWA Summit 2016 is organized and managed by the Oracle BIWA SIG, the Oracle Spatial SIG, and the Oracle Northern California User Group. The event attracts top BI, data warehousing, analytics, Spatial, IoT and Big Data experts.

The three-day event includes keynotes from industry experts, educational sessions, hands-on labs, and networking events.

Hot topics include: 

  • Database, data warehouse and cloud, Big Data architecture
  • Deep dives and hands-on labs on existing Oracle BI, data warehouse, and analytics products
  • Updates on the latest Oracle products and technologies (e.g. Big Data Discovery, Oracle Visual Analyzer, Oracle Big Data SQL)
  • Novel and interesting use cases on everything – Spatial, Graph, Text, Data Mining, IoT, ETL, Security, Cloud
  • Working with Big Data (e.g., Hadoop, "Internet of Things,” SQL, R, Sentiment Analysis)
  • Oracle Business Intelligence (OBIEE), Oracle Big Data Discovery, Oracle Spatial, and Oracle Advanced Analytics—Better Together

Hope to see you at BIWA'16 in January, 2016!


Wednesday Apr 22, 2015

OpenWorld 2015 Call for Proposals Extended to Wed, May 6th, 11:59 p.m

OpenWorld 2015 Call for Proposals Extended to Wed, May 6th, 11:59 p.m Submit your Oracle Advanced Analytics stories now

If you’re an Oracle technology expert, conference attendees want to hear it straight from you. So don’t wait—proposals must be submitted by April 29.

Wanted: Outstanding Oracle Experts

The Oracle OpenWorld 2015 Call for Proposals is now open. Attendees at the conference are eager to hear from experts on Oracle business and technology. They’re looking for insights and improvements they can put to use in their own jobs: exciting innovations, strategies to modernize their business, different or easier ways to implement, unique use cases, lessons learned, the best of best practices.

If you’ve got something special to share with other Oracle users and technologists, they want to hear from you, and so do we. Submit your proposal now for this opportunity to present at Oracle OpenWorld, the most important Oracle technology and business conference of the year.

We recommend you take the time to review the General Information, Submission Information, Content Program Policies, and Tips and Guidelines pages before you begin. We look forward to your submissions.

Submit Your Proposal

By submitting a session for consideration, you authorize Oracle to promote, publish, display, and disseminate the content submitted to Oracle, including your name and likeness, for use associated with the Oracle OpenWorld and JavaOne San Francisco 2015 conferences. Press, analysts, bloggers and social media users may be in attendance at OpenWorld or JavaOne sessions.

General Information

  • Conference location: San Francisco, California, USA
  • Dates: Sunday, October 25 to Thursday, October 29, 2015
  • Website: Oracle OpenWorld

Key Dates for 2015

Deliverables Due Dates
Call for Proposals—Open Wednesday, March 25
Call for Proposals—Closed Wednesday, April 29, 11:59 p.m. PDT
Notifications for accepted and declined submissions sent Mid-June

Contact us

  • For questions regarding the Call for Proposals, send an e-mail to
  • For technical questions about the submission tool or issues with submitting your proposal, send an e-mail to
  • Oracle employee submitters should contact the appropriate Oracle track leads before submitting. To view a list of track leads, click here.

Saturday Mar 28, 2015

Use Repository APIs to Manage and Schedule Workflows to run

Data Miner 4.1 ships with a set of repository PL/SQL APIs that allow applications to manage Data Miner projects and workflows directly. The workflow APIs enable applications to execute workflows immediately or schedule workflows to execute using specific time intervals or using defined schedules. The workflow run APIs internally use Oracle Scheduler for scheduling functionality. Moreover, repository views are provided for applications to query project and workflow information. Applications can also monitor workflow execution status and query generated results using these views.

With the workflow APIs, applications can seamlessly integrate the workflow running process.  Moreover, all generated results are accessible by the Data Miner, so you can view the results using the Data Miner user interface.

For more information, please read the White Paper Use Repository APIs to Manage and Schedule Workflows to run

Monday Dec 15, 2014

Use Oracle Data Miner to Perform Sentiment Analysis inside Database using Twitter Data Demo

Sentiment analysis has been a hot topic recently; sentiment analysis or opinion mining refers to the application of natural language processing, computational linguistics, and text analytics to identify and extract subjective information in source materials.  Social media websites are good source of people sentiments.  Companies have been using social networking sites to make new product announcements, promote their products, collect product reviews and user feedback, interact with their customers, etc.  It is important for companies to sense customer sentiments toward their products, so they can react accordingly to benefit from customers’ opinion.

In this blog, we will show you how to use Data Miner to perform some basic sentiment analysis (based on text analytics) using Twitter data.  The demo data was downloaded from the developer API console page of the Twitter website.  The data itself originated from the Oracle Twitter page, and it contains about a thousand tweets posted in the past six months (May to Oct 2014).  We will determine the sentiments (highly favored, moderately favored, and less favored) of tweets based on their favorite counts, and assign the sentiment to each tweet.  We then build classification models using these tweets along with their assigned sentiments.  The goal is to predict how well a new tweet will be received by customers.  This may help marketing department to better craft a tweet before it is posted.

The demo (click here to download demo twitter data and workflow) will use the newly added JSON Query node in the Data Miner 4.1 to import the twitter data; please review the “How to import JSON data to Data Miner for Mining” blog entry in previous post.

Workflow for Sentiment Analysis

The following workflow shows the process we use to prepare the twitter data, determine the sentiments of tweets, and build classification models on the data.

The following describes the nodes used in the above workflow:

  • Data Source (TWITTER_LARGE)
    • Select the demo Twitter data source.  The sample Twitter data is attached with this blog.
  • JSON Query (JSON Query)
    • Select the required JSON attributes used for analysis; we only use the “id”, “text”, and “favorite_count” attributes.  The “text” attribute contains the tweet, and the “favorite_count” attribute indicates how many times the tweet has been favorited.
  • SQL Query (Cleanse Tweets)
    • Remove shorten URLs and punctuations within tweets because these data contain no predictive information.
  • Filter Rows (Filter Rows)
    • Remove retweeted tweets because these are duplicate tweets.
  • Transform (Transform)
    • Perform quantile bin of the “favorite_count” data into three quantiles; each quantile represent a sentiment.  The top quantile represents “highly favored” sentiment, the middle quantile represents “moderately favored” sentiment, and the bottom quantile represents “less favored” sentiment.
  • SQL Query (Recode Sentiment)
    • Assign quantiles as determined sentiments to tweets.
  • Create Table (OUTPUT_4_29)
    • Persist the data to a table for classification model build (optional).
  • Classification (Class Build)
    • Build classification models to predict customer sentiment toward a new tweet (how much will customer like this new tweet?).

Data Source Node (TWITTER_LARGE)

Select the JSON_DATA in the TWITTER_LARGE table.  The JSON_DATA contains about a thousand tweets to be used for sentiment analysis.

JSON Query Node (JSON Query)

Use the new JSON Query node to select the following JSON attributes.  This node projects the JSON data to relational data format, so that it can be consumed within the workflow process.

SQL Query Node (Cleanse Tweets)

Use the REGEXP_REPLACE function to remove numbers, punctuations, and shorten URLs inside tweets because these data are considered noises and do not provide any predictive information.  Notice we do not treat hash tags inside tweets specially; these tags are treated as regular words.

We specify the number, punctuation, and URL patterns in regular expression syntax and use the database function REGEXP_REPLACE to replace these patterns inside all tweets with empty spaces.

REGEXP_REPLACE("JSON Query_N$10055"."TWEET", '([[:digit:]*]|[[:punct:]*]|(http[s]?://(.*?)(\s|$)))', '', 1, 0) "TWEETS",
"JSON Query_N$10055"."FAVORITE_COUNT",
"JSON Query_N$10055"."ID"
"JSON Query_N$10055"

Filter Rows Node (Filter Rows)

Remove retweeted tweets because these are duplicate tweets.  Usually, retweeted tweets start with a “RT” abbreviate, so we specify the following row filter condition to filter out those tweets.

Transform Node (Transform)

Use the Transform node to perform quantile bin of the “favorite_count” data into three quantiles; each quantile represent a sentiment.  For simplicity, we just bin the count into three quantiles without applying any special treatment first.

SQL Query Node (Recode Sentiment)

Assign quantiles as determined sentiments to tweets; top quantile represents “highly favored” sentiment, the middle quantile represents “moderately favored” sentiment, and the bottom quantile represents “less favored”.  These sentiments become target classes for the classification model build.

Classification Node (Class Build)

Build Classification models using the sentiment as target and tweet id as case id.

Since the TWEETS column contains the textual tweets, so we change the mining type to Text Custom.

Enable the Stemming option for text processing.

Compare Test Results

After the model build completes successfully, open the test viewer to compare model test results, the SVM model seems to produce the best prediction for the “highly favored” sentiment (57% correct prediction).

Moreover, the SVM model has better lift result than other models, so we will use this model for scoring.

Sentiment Prediction (Scoring)

Let’s score this tweet “this is a boring tweet!” using the SVM model.

As expected, this tweet receives a “less favored” prediction.

How about this tweet “larry is doing a data mining demo now!” ?

Not surprisingly, this tweet receives a “highly favored” prediction.

Last but not least, let’s see the sentiment prediction for the title of this blog

Not bad it gets a “highly favored” prediction, so it seems this title will be well received by audience.


The best SVM model only produces 57% accuracy for the “highly favored” sentiment prediction, but it is reasonably better than random guess.  For a larger sample of tweet data, the model accuracy could be improved.  With the new JSON Query node, it enables us to perform data mining on JSON data which is the most popular data format produced by prominent social networking sites.

Monday Dec 08, 2014

How to import JSON data to Data Miner for Mining

JSON is a popular lightweight data structure used by Big Data. Increasingly, a lot of data produced by Big Data are in JSON format. For example, web logs generated in the middle tier web servers are likely in JSON format. NoSQL database vendors have chosen JSON as their primary data representation. Moreover, the JSON format is widely used in the RESTful style Web services responses generated by most popular social media websites like Facebook, Twitter, LinkedIn, etc. This JSON data could potentially contain wealth of information that is valuable for business use. So it is important that we can bring this data over to Data Miner for analysis and mining purposes.

Oracle database provides ability to store and query JSON data. To take advantage of the database JSON support, the upcoming Data Miner 4.1 added a new JSON Query node that allows users to query JSON data as relational format. In additional, the current Data Source node and Create Table node are enhanced to allow users to specify JSON data in the input data source.

In this blog, I will show you how to specify a JSON data in the input data source and use JSON Query node to selectively query desirable attributes and project the result in relational format. Once the data is in relational format, users can treat it as a normal relational data source and start analyzing and mining it immediately. The Data Miner repository installation installs a sample JSON dataset ODMR_SALES_JSON_DATA, which I will be using it here. However, Oracle Big Data SQL supports queries against vast amounts of big data stored in multiple data sources, including Hadoop. Users can view and analyze data from various data stores together, as if it were all stored in an Oracle database.

Specify JSON Data

The Data Source node and Create Table nodes are enhanced to allow users to specify the JSON data type in the input data source.

Data Source Node

For this demo, we will focus on the Data Source node. To specify JSON data, create a new workflow with a Data Source node. In the Define Data Source wizard, select the ODMR_SALES_JSON_DATA table. Notice there is only one column (JSON_DATA) in this table, which contains the JSON data.

Click Next to go to the next step where it shows the JSON_DATA is selected with the JSON(CLOB) data type. The JSON prefix indicates the data stored is in JSON format; the CLOB is the original data type. The JSON_DATA column is defined with the new “IS JSON” constraint, which indicates only valid JSON document can be stored there. The UI can detect this constraint and automatically select the column as JSON type. If there was not a “IS JSON” constraint defined, the column would be shown with a CLOB data type. To manually designate a column as a JSON type, click on the data type itself to bring up a in-place dropdown where it lists the original data type (e.g. CLOB) and a corresponding JSON type (e.g. JSON(CLOB)), so just select the JSON type. Note: only the following data types can be set to JSON type: VARCHAR2, CLOB, BLOB, RAW, NCLOB, and NVARCHAR2.

Click Finish and run the node now.

Once the node is run successfully, open the editor to examine the generated JSON schema.

Notice the message “System Generated Data Guide is available” at the bottom of the Selected Attributes listbox. What happens here is when the Data Source node is run, it parsed the JSON documents to produce a schema that represents the document structure. Here is what the schema looks like:











































The JSON Path expression syntax and associated data type info (OBJECT, ARRAY, NUMBER, STRING, BOOLEAN, NULL) are used to represent JSON document structure. We will refer to this JSON schema as Data Guide throughout the product.

Before we look at the Data Guide in the UI, let’s look at the settings that can affect how it is generated. Click the “JSON Settings…” button to open the JSON Parsing Settings dialog.

The settings are described below:

· Generate Data Guide if necessary

o Generate a Data Guide if it is not already generated in parent node.

· Sampling

o Sample JSON documents for Data Guide generation.

· Max. number of documents

o Specify maximum number of JSON documents to be parsed for Data Guide generation.

· Limit Document Values to Process

o Sample JSON document values for Data Guide generation.

· Max. number per document

o Specify maximum number of JSON document scalar values (e.g. NUMBER, STRING, BOOLEAN, NULL) per document to be parsed for Data Guide generation.

The sampling option is enabled by default to prevent long-running parsing of JSON documents; parsing could take a while for large number of documents. However, users may supply a Data Guide (Import from File) or reuse an existing Data Guide (Import from Workflow) if compatible Data Guide is available.

Now let’s look at the Data Guide, go back to the Edit Data Source Node dialog, select the JSON_DATA column and click the above to open the Edit Data Guide dialog. The dialog shows the JSON structure in a hierarchical tree view with data type information. The “Number of Values Processed” shows the total number of JSON scalar values was parsed to produce the Data Guide.

Users can control whether to enable Data Guide generation or import a compatible Data Guide via the menu under the icon.

The menu options are described below:

· Default

o Use the “Generate Data Guide if necessary” setting found in the JSON Parsing Setting dialog (see above).

· On

o Always generate a Data Guide.

· Off

o Do not generate a Data Guide.

· Import From Workflow

o Import a compatible Data Guide from a workflow node (e.g. Data Source, Create Table). The option will be set to Off after the import (disable Data Guide generation).

· Import From File

o Import a compatible Data Guide from a file. The option will be set to Off after the import (disable Data Guide generation).

Users can also export the current Data Guide to a file via the icon.

Select JSON Data

In Data Miner 4.1, a new JSON Query node is added to allow users to selectively bring over desirable JSON attributes as relational format.

JSON Query Node

The JSON Query node is added to the Transforms group of the Workflow.

Let’s create a JSON Query node and connect the Data Source node to it.

Double click the JSON Query node to open the editor. The editor consists of four tabs, and these tabs are described as followings:


The Column dropdown lists all available columns in the data source where JSON structure (Data Guide) is found. It consists of the following two sub tabs:

o Structure

o Show the JSON structure of the selected column in a hierarchical tree view.

o Data

o Show sample of JSON documents found in the selected column. By default it displays first 2,000 characters (including spaces) of the documents. Users can change the sample size (max. 50,000 chars) and run the query to see more of the documents.

· Addition output

o Allow users to select any non-JSON columns in the data source as additional output columns.

· Aggregation

o Allow users to define aggregations of JSON attributes.

· Preview

o Output Columns

o Show columns in the generated relational output.

o Output Data

o Show data in the generated relational output.


Let’s select some JSON attributes to bring over. Skip the SALES attributes because we want to define aggregations for these attributes (QUANTITY_SOLD and AMOUNT_SOLD).

To peek at the JSON documents, go to the Data tab. You can change the Sample Size to look at more JSON data. Also, you can search for specific data within the displayed documents by using the search control.

Addition Output Tab

If you have any non-JSON columns in the data source that you want to carry over for output, you can select those columns here.

Aggregate Tab

Let’s define aggregations (use SUM function) for QUANTITY_SOLD and AMOUNT_SOLD attributes (within the SALES array) for each customer group (group by CUST_ID).

Click the icon in the top toolbar to open the Edit Group By dialog, where you can select the CUST_ID as the Group-By attribute. Notice the Group-By attribute can consists of multiple attributes.

Click OK to return to the Aggregate tab, where you can see the selected CUST_ID Group-By attribute is now added to the Group By Attributes table at the top.

Click the icon in the bottom toolbar to open the Add Aggregations dialog, where you can define the aggregations for both QUANTITY_SOLD and AMOUNT_SOLD attributes using the SUM function.

Next, click the icon in the toolbar to open the Edit Sub Group By dialog, where you can specify a Sub-Group By attribute (PROD_ID) to calculate quantity sold and amount sold per product per customer.

Specifying a Sub-Group By column creates a nested table; the nested table contains columns with data type DM_NESTED_NUMERICALS.

Click OK to return to the Aggregate tab, where you can see the defined aggregations are now added to the Aggregation table at the bottom.

Preview Tab

Let’s go to the Preview tab to look at the generated relational output. The Output Columns tab shows all output columns and their corresponding source JSON attributes. The output columns can be renamed by using the in-place edit control.

The Output Data tab shows the actual data in the generated relational output.

Click OK to close the editor when you are done. The generated relational output is single-record case format; each row represents a case. If we had not defined the aggregations for the JSON array attributes, the relational output would have been in multiple-record case format. The multiple-record case format is not suitable for building mining models except for Association model (which accepts transactional data format with transaction id and item id).

Use Case

Here is an example of how JSON Query node is used to project the JSON data source to relational format, so that the data can be consumed by Explore Data node for data analysis and Class Build node for building models.


This blog shows how JSON data can be brought over to Data Miner via the new JSON Query node. Once the data is projected to relational format, it can easily be consumed by Data Miner for graphing, data analysis, text processing, transformation, and modeling.

Thursday Nov 20, 2014


Please share with your Oracle BI, DW, Analytics, big Data and Spatial User coMMUNITY.   THANKS.  CB

BIWA Summit’15 Jan 27-29, 2015 Early Bird Registration Ends Friday. 

Registration is now LIVE. Register by November 21st (tomorrow) to receive the early bird pricing of $249 and save $50.

Please direct your colleagues to REGISTER NOW and participate to take advantage of the Early Bird registration ($249.00 USD).  EARLY BIRD SPECIAL ENDS TOMORROW (Friday, Nov. 21).  Here’s some information about the event below and some pics and talks from last year to give some feel for the opportunity.   

BIWA Summits have been organized and managed by the Oracle BI, DW and Analytics SIG user community of IOUG (Independent Oracle User Group) and attract the top Oracle BI, DW and Advanced Analytics and Big Data experts. The 2.5-day BIWA Summit'15 event joins forces with the Oracle Spatial SIG and involves Keynotes by Industry experts, Educational sessions, Hands-on Labs and networking events. We have a great line up so far w/ Tom Kyte Senior Technical Architect in Oracles Server Technology, Doug Cutting (Chief Architect, Cloudera), Oracle BI Senior Management, Neil Mendelson, VP of Product Management Big Data and Advanced Analytics, Matt Bradley, SVP, Oracle Product Development, EPM Applications, other features speakers, and many customers/tech experts (see web site and search % Sessions). Our BIWA Summit offers a broad, multi-track user driven conference that has built up a growing reputation over the years. We emphasize technical content and networking with like minded customers, users, developers, product managers (Database, Big Data Appliance, Oracle Advanced Analytics, Spatial, OBIEE, Endeca, Big Data Discovery, In-Memory, SQL Patterns, etc.), etc. who all share an interest in “novel and interesting use cases” of Oracle BI, DW, Advanced Analytics and Spatial technologies, applications and solutions. We’re off to a great start this year with a great agenda and hope to pack the HQ CC this Jan 27-29, 2015 with 300+ attendees.

Please forward and share with your Oracle BI, DW, Analytics, Big Data and Spatial colleagues.   

Thank you!  Hope to see you at BIWA Summit'15


Wednesday Oct 08, 2014

2014 was a very good year for Oracle Advanced Analytics at Oracle Open World 2014

2014 was a very good year for Oracle Advanced Analytics at Oracle Open World 2014.   We had a number of customer, partner and Oracle talks that focused on the Oracle Advanced Analytics Database Option.    See below with links to presentations.  Check back later to OOW Sessions Content Catalog as not all presentations have been uploaded yet.  :-(

Big Data and Predictive Analytics: Fiserv Data Mining Case Study [CON8631]

Moving data mining algorithms to run as native data mining SQL functions eliminates data movement, automates knowledge discovery, and accelerates the transformation of large-scale data to actionable insights from days/weeks to minutes/hours. In this session, Fiserv, a leading global provider of electronic commerce systems for the financial services industry, shares best practices for turning in-database predictive models into actionable policies and illustrates the use of Oracle Data Miner for fraud prevention in online payments. Attendees will learn how businesses that implement predictive analytics in their production processes significantly improve profitability and maximize their ROI.

Developing Relevant Dining Visits with Oracle Advanced Analytics at Olive Garden [CON2898]

Olive Garden, traditionally managing its 830 restaurants nationally, transitioned to a localized approach with the help of predictive analytics. Using k-means clustering and logistic classification algorithms, it divided its stores into five behavioral segments. The analysis leveraged Oracle SQL Developer 4.0 and Oracle R Enterprise 1.3 to evaluate 115 million transactions in just 5 percent the time required by the company’s BI tool. While saving both time and money by making it possible to develop the solution internally, this analysis has informed Olive Garden’s latest remodel campaign and continues to uncover millions in profits by optimizing pricing and menu assortment. This session illustrates how Oracle Advanced Analytics solutions directly affect the bottom line.

A Perfect Storm: Oracle Big Data Science for Enterprise R and SAS Users [CON8331]

With the advent of R and a rich ecosystem of users and developers, a myriad of bloggers, and thousands of packages with functionality ranging from social network analysis and spatial data analysis to empirical finance and phylogenetics, use of R is on a steep uptrend. With new R tools from Oracle, including Oracle R Enterprise, Oracle R Distribution, and Oracle R Advanced Analytics for Hadoop, users can scale and integrate R for their enterprise big data needs. Come to this session to learn about Oracle’s R technologies and what data scientists from smart companies around the world are doing with R.

Extending the Power of In-Database Analytics with Oracle Big Data Appliance [CON2452]

The need for speed could not be greater—not speed of processing but time to market. The problem is driven by the long journey data takes before evolving into insight. Insight, however, is always relative to assumption. In fact, analytics is often seen as a battle between assumption and data. Assumptions can be classified into three types: related to distributions, ratios, and relations. In this session, you will see how the most-valuable business insights can come in the matter of hours, not months, when assumptions are challenged with data. This is made possible by the integration of Oracle Big Data Appliance, enabling transparent access to in-database analytics from the data warehouse and avoiding the traditional long journey of data to insight.

Market Basket Analysis at Dunkin’ Brands [CON6545]

With almost 120 years of franchising experience, Dunkin’ Brands owns two of the world’s most recognized, beloved franchises: Dunkin’ Donuts and Baskin-Robbins. This session describes a market basket analysis solution built from scratch on the Oracle Advanced Analytics platform at Dunkin’ Brands. This solution enables Dunkin’ to look at product affinity and a host of associated sales metrics with a view to improving promotional effectiveness and cross-sell/up-sell to increase customer loyalty. The presentation discusses the business value achieved and technical challenges faced in scaling the solution to Dunkin’ Brands’ transaction volumes, including engineered systems (Oracle Exadata) hardware and parallel processing at the core of the implementation.

Predictive Analytics with Oracle Data Mining [CON8596]

This session presents three case studies related to predictive analytics with the Oracle Data Mining feature of Oracle Advanced Analytics. Service contracts cancellation avoidance with Oracle Data Mining is about predicting the contracts at risk of cancellation at least nine months in advance. Predicting hardware opportunities that have a high likelihood of being won means identifying such opportunities at least four months in advance to provide visibility into suppliers of required materials. Finally, predicting cloud customer churn involves identifying the customers that are not as likely to renew subscriptions as others.

SQL Is the Best Development Language for Big Data [CON7439]

SQL has a long and storied history. From the early 1980s till today, data processing has been dominated by this language. It has changed and evolved greatly over time, gaining features such as analytic windowing functions, model clauses, and row-pattern matching. This session explores what's new in SQL and Oracle Database for exploiting big data. You'll see how to use SQL to efficiently and effectively process data that is not stored directly in Oracle Database.

Advanced Predictive Analytics for Database Developers on Oracle [CON7977]

Traditional database applications use SQL queries to filter, aggregate, and summarize data. This is called descriptive analytics. The next level is predictive analytics, where hidden patterns are discovered to answer questions that give unique insights that cannot be derived with descriptive analytics. Businesses are increasingly using machine learning techniques to perform predictive analytics, which helps them better understand past data, predict future trends, and enable better decision-making. This session discusses how to use machine learning algorithms such as regression, classification, and clustering to solve a few selected business use cases.

What Are They Thinking? With Oracle Application Express and Oracle Data Miner [UGF2861]

Have you ever wanted to add some data science to your Oracle Application Express applications? This session shows you how you can combine predictive analytics from Oracle Data Miner into your Oracle Application Express application to monitor sentiment analysis. Using Oracle Data Miner features, you can build data mining models of your data and apply them to your new data. The presentation uses Twitter feeds from conference events to demonstrate how this data can be fed into your Oracle Application Express application and how you can monitor sentiment with the native SQL and PL/SQL functions of Oracle Data Miner. Oracle Application Express comes with several graphical techniques, and the presentation uses them to create a sentiment dashboard.

Transforming Customer Experience with Big Data and Predictive Analytics [CON8148]

Delivering a high-quality customer experience is essential for long-term profitability and customer retention in the communications industry. Although service providers own a wealth of customer data within their systems, the sheer volume and complexity of the data structures inhibit their ability to extract the full value of the information. To change this situation, service providers are increasingly turning to a new generation of business intelligence tools. This session begins by discussing the key market challenges for business analytics and continues by exploring Oracle’s approach to meeting these challenges, including the use of predictive analytics, big data, and social network analytics.

There are a few others where Oracle Advanced Analytics is included e.g. Retail GBU, Big Data Strategy, etc. but they are typically more broadly focused.  If you search the Content Catalog for “Advanced Analytics” etc. you can find other related presentations that involve OAA.

Hope this helps.  Enjoy!


Wednesday Aug 06, 2014

New Book: Predictive Analytics Using Oracle Data Miner

Great New Book Now Available:  Predictive Analytics Using Oracle Data Miner, by Brendan Tierney, Oracle ACE Director

If you have an Oracle Database and want to leverage that data to discover new insights, make predictions and generate actionable insights, this book is a must read for you!  In Predictive Analytics Using Oracle Data Miner: Develop & Use Oracle Data Mining Models in Oracle Data Miner, SQL & PL/SQL, Brendan Tierney, Oracle ACE Director and data mining expert, guides the user through the basic concepts of data mining and offers step by step instructions for solving data-driven problems using SQL Developer’s Oracle Data Mining extension.  Brendan takes it full circle by showing the reader how to deploy advanced analytical methodologies and predictive models immediately into enterprise-wide production environments using the in-database SQL and PL/SQL functionality.  

Definitely a must read for any Oracle data professional!

See Predictive Analytics Using Oracle Data Miner, by Brendan Tierney on  

Sunday May 18, 2014

Oracle Data Miner and Oracle R Enterprise Integration - Watch Demo

Oracle Data Miner and Oracle R Enterprise Integration - Watch Demo

Oracle Advanced Analytics (Database EE) Option turns the database into an enterprise-wide analytical platform that can quickly deliver enterprise-wide predictive analytics and actionable insights.  Oracle Advanced Analytics is comprised of both the Oracle Data Mining SQL data mining functions, Oracle Data Miner, an extension to SQL Developer that exposes the data mining SQL functions for data analysts, and Oracle R Enterprise which integrates the R statistical programming language with SQL.  15 powerful in-database SQL data mining functions, the SQL Developer/Oracle Data Miner workflow GUI and the ability to integrate open source R within an analytical methodology, makes the Oracle Database + Oracle Advanced Analytics Option the ideal platform for building and deploying enterprise-wide predictive analytics applications/solutions.  

In Oracle Data Miner 4.0 we added a new SQL Query node to allow users to insert arbitrary SQL scripts within an ODMr analytical workflow. Additionally, the SQL Query node allows users to leverage registered R scripts to extend Oracle Data Miner's analytical capabilities.  For applications that are mostly OAA/Oracle Data Mining SQL data mining functions based but require additional analytical techniques found in the R community, this is an ideal method for integrating the power of in-database SQL analytical and data mining functions with the flexibility of open source R.  For applications that are built entirely using the R statistical programming language, it may be more practical to stay within the R console or RStudio environments, but for SQL-centric in-database predictive methodologies, this integration is just what might satisfy your needs.

Watch this Oracle Data Miner and Oracle R Enteprise Integration YouTube to see the demo. 

There is an excellent related Oracle Data Miner:  Integrate Oracle R Enterprise Algorithms into workflow using the SQL Query node (pdf, companion files) white paper on this topic that includes examples on the Oracle Technology Network in the Oracle Data Mining pages.  

Tuesday May 06, 2014

Oracle Data Miner 4.0/SQLDEV 4.0 New Features - Watch Demo!

Oracle Data Miner 4.0 New Features 

Oracle Data Miner/SQLDEV 4.0 (for Oracle Database 11g and 12c)

  • New Graph node (box, scatter, bar, histograms)
  • SQL Query node + integration of R scripts
  • Automatic SQL script generation for deployment

Oracle Advanced Analytics 12c New SQL data mining algorithms/enhancements features exposed in Oracle Data Miner 4.0

  • Expectation Maximization Clustering algorithm
  • PCA & Singular Vector Decomposition algorithms
  • Decision Trees can also now mine unstructured data
  • Improved/automated Text Mining, Prediction Details and other algorithm improvements
  • SQL Predictive Queries—automatic build, apply within simple yet powerful SQL query

Tuesday Mar 18, 2014

Deploy Data Miner Apply Node SQL as RESTful Web Service for Real-Time Scoring

The free Oracle Data Miner GUI is an extension to Oracle SQL Developer that enables data analysts to work directly with data inside the database, explore the data graphically, build and evaluate multiple data mining models, apply Oracle Data Mining models to new data and deploy Oracle Data Mining's predictions and insights throughout the enterprise. The product enables a complete workflow deployment to a production system via generated PL/SQL scripts (See Generate a PL/SQL script for workflow deployment). This time I want to focus on the model scoring side, especially the single record real-time scoring. Would it be nice if the scoring function can be accessed by different systems on different platforms? How about deploying the scoring function as a Web Service? This way any system that can send HTTP request can invoke the scoring Web Service, and consume the returning result as they see fit. For example, you can have a mobile app that collects customer data, and then invokes the scoring Web Service to determine how likely the customer is going to buy a life insurance. This blog shows a complete demo from building predictive models to deploying a scoring function as a Web Service. However, the demo does not take into account of any authentication and security consideration related to Web Services, which is out of the scope of this blog.

Web Services Requirement

This demo uses the Web Services feature provided by the Oracle APEX 4.2 and Oracle REST Data Services 2.0.6 (formerly Oracle APEX Listener). Here are the installation instructions for both products:

For 11g Database

Go to the Oracle Application Express Installation Guide and following the instructions below:

1.5.1 Scenario 1: Downloading from OTN and Configuring the Oracle Application Express Listener

· Step 1: Install the Oracle Database and Complete Pre-installation Tasks

· Step 2: Download and Install Oracle Application Express

· Step 3: Change the Password for the ADMIN Account

· Step 4: Configure RESTful Services

· Step 5: Restart Processes

· Step 6: Configure APEX_PUBLIC_USER Account

· Step 7: Download and Install Oracle Application Express Listener

· Step 8: Enable Network Services in Oracle Database 11g

· Step 9: Security Considerations

· Step 10: About Developing Oracle Application Express in Other Languages

· Step 11: About Managing JOB_QUEUE_PROCESSES

· Step 12: Create a Workspace and Add Oracle Application Express Users

For 12c Database

Go to Oracle Application Express Installation Guide (Release 4.2 for Oracle Database 12c) and following the instructions below:

4.4 Installing from the Database and Configuring the Oracle Application Express Listener

· Install the Oracle Database and Complete Preinstallation Tasks

· Download and Install Oracle Application Express Listener

· Configure RESTful Services

· Enable Network Services in Oracle Database 12c

· Security Considerations

· About Running Oracle Application Express in Other Languages


· Create a Workspace and Add Oracle Application Express Users

Note: The APEX is pre-installed with the Oracle database 12c, but you need to configure it in order to use it.

For this demo, create a Workspace called DATAMINER that is based on an existing user account that has already been granted access to the Data Miner (this blog assumes DMUSER is the Data Miner user account). Please refer to the Oracle By Example Tutorials to review how to create a Data Miner user account and install the Data Miner Repository. In addition, you need to create an APEX user account (for simplicity I use DMUSER).

Build Models to Predict BUY_INSURANCE

This demo uses the demo data set, INSUR_CUST_LTV_SAMPLE, that comes with the Data Miner installation. Now, let’s use the Classification Build node to build some models using the CUSTOMER_ID as the case id and BUY_INSURANCE as the target.

Evaluate the Models

Nice thing about the Build node is that it builds a set of models with different algorithms within the same mining function by default, so we can select the best model to use. Let’s look at the models in the Test Viewer; here we can compare the models by looking at their Predictive Confidence, Overall Accuracy, and Average Accuracy values. Basically, the model with the highest values across these three metrics is the good one to use. As you can see, the winner here is the CLAS_DT_3_6 decision tree model.

Next, let’s see what input data columns are used as predictors for the decision tree model. You can find that information in the Model Viewer below. Surprisingly, it only uses a few columns for the prediction. These columns will be our input data requirement for the scoring function, the rest of the input columns can be ignored.

Score the Model

Let’s complete the workflow with an Apply node, from which we will generate the scoring SQL statement to be used for the Web Service. Here we reuse the INSUR_CUST_LTV_SAMPLE data as input data to the Apply node, and select only the required columns as found in the previous step. Also, in the Class Build node we deselect the other models as output in the Property Inspector (Models tab), so that only decision tree model will be used for the Apply node. The generated scoring SQL statement will use only the decision tree model to score against the limited set of input columns.

Generate SQL Statement for Scoring

After the workflow is run successfully, we can generate the scoring SQL statement via the “Save SQL” context menu off the Apply node as shown below.

Here is the generated SQL statement:

/* SQL Deployed by Oracle SQL Developer from Node "Apply", Workflow "workflow score", Project "project", Connection "conn_12c" on Mar 16, 2014 */
ALTER SESSION set "_optimizer_reuse_cost_annotations"=false;
/* Start of sql for node: INSUR_CUST_LTV_SAMPLE APPLY */
"N$10013" as (select /*+ inline */ "INSUR_CUST_LTV_SAMPLE"."BANK_FUNDS",
/* End of sql for node: INSUR_CUST_LTV_SAMPLE APPLY */
/* Start of sql for node: Apply */
"N$10011" as (SELECT /*+ inline */
FROM "N$10013" )
/* End of sql for node: Apply */
select * from "N$10011";

We need to modify the first SELECT SQL statement to change the data source from a database table to a record that can be constructed on the fly, which is crucial for real-time scoring. The bind variables (e.g. :funds) are used; these variables will be replaced with actual data (passed in by the Web Service request) when the SQL statement is executed.

/* SQL Deployed by Oracle SQL Developer from Node "Apply", Workflow "workflow score", Project "project", Connection "conn_12c" on Mar 16, 2014 */
/* Start of sql for node: INSUR_CUST_LTV_SAMPLE APPLY */
"N$10013" as (select /*+ inline */
:funds "BANK_FUNDS",
:checking "CHECKING_AMOUNT",
:atm "N_TRANS_ATM",
from DUAL
/* End of sql for node: INSUR_CUST_LTV_SAMPLE APPLY */
/* Start of sql for node: Apply */
"N$10011" as (SELECT /*+ inline */
FROM "N$10013" )
/* End of sql for node: Apply */
select * from "N$10011";

Create Scoring Web Service

Assume the Oracle APEX and Oracle REST Data Services have been properly installed and configured; we can proceed to create a RESTful web service for real-time scoring. The followings describe the steps to create the Web Service in APEX:

1. APEX Login

You can bring up the APEX login screen by pointing your browser to http://<host>:<port>/ords. Enter your Workspace name and account info to login. The Workspace should be based on the Data Miner DMUSER account for this demo to work.

2. Select SQL Workshop

Select the SQL Workshop icon to proceed.

3. Select RESTful Services

Select the RESTful Services to create the Web Service.

Click the “Create” button to continue.

4. Define Restful Services

Enter the following information to define the scoring Web Service in the RESTful Services Module form:

Name: buyinsurance

URI Prefix: score/

Status: Published

URI Template: buyinsurance?funds={funds}&checking={checking}&credit={credit}&atm={atm}&payments={payments}

Method: GET

Source Type: Query Format: CSV


/* SQL Deployed by Oracle SQL Developer from Node "Apply", Workflow "workflow score", Project "project", Connection "conn_11204" on Mar 16, 2014 */
/* Start of sql for node: INSUR_CUST_LTV_SAMPLE APPLY */
"N$10013" as (select /*+ inline */
:funds "BANK_FUNDS",
:checking "CHECKING_AMOUNT",
:atm "N_TRANS_ATM",
from DUAL
/* End of sql for node: INSUR_CUST_LTV_SAMPLE APPLY */
/* Start of sql for node: Apply */
"N$10011" as (SELECT /*+ inline */
FROM "N$10013" )
/* End of sql for node: Apply */
select * from "N$10011";

Note: JSON output format is supported.

Lastly, create the following parameters that are used to pass the data from the Web Service request (URI) to the bind variables used in the scoring SQL statement.

The final RESTful Services Module definition should look like the following. Make sure the “Requires Secure Access” is set to “No” (HTTPS secure request is not addressed in this demo).

Test the Scoring Web Service

Let’s create a simple web page using your favorite HTML editor (I use JDeveloper to create this web page). The page includes a form that is used to collect customer data, and then fires off the Web Service request upon submission to get a prediction and associated probability.

Here is the HTML source of the above Form:

<!DOCTYPE html>



<meta http-equiv="Content-Type" content="text/html; charset=UTF-8"/>





Determine if Customer will Buy Insurance


<form action="http://localhost:8080/ords/dataminer/score/buyinsurance" method="get">



<td>Bank Funds:</td>

<td><input type="text" name="funds"/></td>



<td>Checking Amount:</td>

<td><input type="text" name="checking"/></td>



<td>Credit Balance:</td>

<td><input type="text" name="credit"/></td>



<td>Number ATM Transactions:</td>

<td><input type="text" name="atm"/></td>



<td>Amount Auto Payments:</td>

<td><input type="text" name="payments"/></td>



<td colspan="2" align="right">

<input type="submit" value="Score"/>






When the Score button is pressed, the form sends a GET HTTP request to the web server with the collected form data as name-value parameters encoded in the URL.


Notice the {funds}, {checking}, {credit}, {atm}, {payments} will be replaced with actual data from the form. This URI matches the URI Template specified in the RESTful Services Module form above.

Let’s test out the scoring Web Service by entering some values in the form and hit the Score button to see the prediction.

The prediction along with its probability and cost is returned as shown below. Unfortunately, this customer is less likely to buy insurance.

Let’s change some values and see if we have any luck.

Bingo! This customer is more likely to buy insurance.


This blog shows how to deploy Data Miner generated scoring SQL as Web Service, which can be consumed by different systems on different platforms from anywhere. In theory, any SQL statement generated from the Data Miner node could potentially be made as Web Services. For example, we can have a Web Service that returns Model Details info, and this info can be consumed by some BI tool for application integration purpose.

Tuesday Nov 12, 2013

Oracle Big Data Learning Library

Click on LEARN BY PRODUCT to view all learning resources.

Oracle Big Data Essentials

Attend this Oracle University Course!

Using Oracle NoSQL Database

Attend this Oracle University class!

Oracle and Big Data on OTN

See the latest resource on OTN.

<script type="text/javascript"> var _gaq = _gaq || []; _gaq.push(['_setAccount', 'UA-46756583-1']); _gaq.push(['_trackPageview']); (function() { var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true; ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + ''; var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s); })(); </script>

Wednesday Sep 04, 2013

Oracle Data Miner (Extension of SQL Developer 4.0) Integrate Oracle R Enterprise Mining Algorithms into workflow using the SQL Query node

I posted a new white paper authored by Denny Wong, Principal Member of Technical Staff, User Interfaces and Components, Oracle Data Mining Technologies.  You can access the white paper here and the companion files here.  Here is an excerpt:

Oracle Data Miner (Extension of SQL Developer 4.0) 

Integrate Oracle R Enterprise Mining Algorithms into workflow using the SQL Query node

Oracle R Enterprise (ORE), a component of the Oracle Advanced Analytics Option, makes the open source R statistical programming language and environment ready for the enterprise and big data. Designed for problems involving large amounts of data, Oracle R Enterprise integrates R with the Oracle Database. R users can develop, refine and deploy R scripts that leverage the parallelism and scalability of the database to perform predictive analytics and data analysis.

Oracle Data Miner (ODMr) offers a comprehensive set of in-database algorithms for performing a variety of mining tasks, such as classification, regression, anomaly detection, feature extraction, clustering, and market basket analysis. One of the important capabilities of the new SQL Query node in Data Miner 4.0 is a simplified interface for integrating R scripts registered with the database. This provides the support necessary for R Developers to provide useful mining scripts for use by data analysts. This synergy provides many additional benefits as noted below.

· R developers can further extend ODMr mining capabilities by incorporating the extensive R mining algorithms from the open source CRAN packages or leveraging any user developed custom R algorithms via SQL interfaces provided by ORE.

· Since this SQL Query node can be part of a workflow process, R scripts can leverage functionalities provided by other workflow nodes which can simplify the overall effort of integrating R capabilities within the database.

· R mining capabilities can be included in the workflow deployment scripts produced by the new sql script generation feature. So the ability of deploy R functionality within the context of an Data Miner workflow is easily accomplished.

· Data and processing are secured and controlled by the Oracle Database. This alleviates a lot of risk that are incurred by other providers, when users have to export data out of the database in order to perform advanced analytics.

Oracle Advanced Analytics saves analysts, developers, database administrators and management the headache of trying to integrate R and database analytics. Instead, users can quickly gain the benefit of new R analytics and spend their time and effort on developing business solutions instead of building homegrown analytical platforms.

This paper should be very useful to R developers wishing to better understand how to leverage imbedding R Scripts for use by Data Analysts.  Analysts will also find the paper useful to see how R features can be surfaced for their use in Data Miner. The specific use case covered demonstrates how to use the SQL Query node to integrate R glm and rpart regression model build, test, and score operations into the workflow along with nodes that perform data preparation and residual plot graphing. However, the integration process described here can easily be adapted to integrate other R operations like statistical data analysis and advanced graphing to expand ODMr functionalities.

<script type="text/javascript"> var _gaq = _gaq || []; _gaq.push(['_setAccount', 'UA-46756583-1']); _gaq.push(['_trackPageview']); (function() { var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true; ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + ''; var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s); })(); </script>

Monday Jul 15, 2013

Oracle Data Miner GUI, part of SQL Developer 4.0 Early Adopter 1 is now available for download on OTN

The NEW Oracle Data Miner GUI, part of SQL Developer 4.0 Early Adopter 1 is now available for download on OTN.  See link to SQL Developer 4.0 EA1.   

The Oracle Data Miner 4.0 New Features are applicable to Oracle Database 11g Release 2 and Oracle Database Release 12c:  See Oracle Data Miner Extension to SQL Developer 4.0 Release Notes for EA1 for additional information  

· Workflow SQL Script Deployment

o Generates SQL scripts to support full deployment of workflow contents

· SQL Query Node

o Integrate SQL queries to transform data or provide a new data source

o Supports the running of R Language Scripts and viewing of R generated data and graphics

· Graph Node

o Generate Line, Scatter, Bar, Histogram and Box Plots

· Model Build Node Improvements

o Node level data usage specification applied to underlying models

o Node level text specifications to govern text transformations

o Displays heuristic rules responsible for excluding predictor columns

o Ability to control the amount of Classification and Regression test results generated

· View Data

o Ability to drill in to view custom objects and nested tables

These new Oracle Data Miner GUI capabilities expose Oracle Database 12c and Oracle Advanced Analytics/Data Mining Release 1 features:

· Predictive Query Nodes

o Predictive results without the need to build models using Analytical Queries

o Refined predictions based on data partitions

· Clustering Node New Algorithm

o Added Expectation Maximization algorithm

· Feature Extraction Node New Algorithms

o Added Singular Value Decomposition and Principal Component Analysis algorithms

· Text Mining Enhancements

o Text transformations integrated as part of Model's Automatic Data Preparation

o Ability to import Build Text node specifications into a Model Build node

· Prediction Result Explanations

o Scoring details that explain predictive result

· Generalized Linear Model New Algorithm Settings

o New algorithm settings provide feature selection and generation

See OAA on OTN pages for more information on Oracle Advanced Analytics.

<script type="text/javascript"> var _gaq = _gaq || []; _gaq.push(['_setAccount', 'UA-46756583-1']); _gaq.push(['_trackPageview']); (function() { var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true; ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + ''; var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s); })(); </script>

Wednesday May 08, 2013

Oracle Advanced Analytics and Data Mining at the Movies on YouTube - Updated July 12, 201625

Updated July 25, 2016

Periodically, I've recorded a demonstration and/or presentation on Oracle Advanced Analytics and Data Mining and have posted them on YouTube.

Here are links to some of more recent YouTube postings--sort of an Oracle Advanced Analytics and Data Mining at the Movies experience.

  1. Mining Structured and Unstructured Data using Oracle Advanced Analytics (slides)  - Watch on YouTube

  2. New Big Data Analyics using Oracle Advanced Analytics12c and Big Data SQL  - Watch on YouTube
  3. New Oracle Academy Webcast:  Ask the Oracle Experts Fraud &  Anomaly Detection using Oracle Advanced Analytics 12c & Big Data SQL - Watch YouTube
  4. New - Oracle Academy Webcast:  Ask the Oracle Experts Big Data Analytics with Oracle Advanced Analytics - Watch YouTube
  5. Oracle Data Miner and Oracle R Enterprise Integration via SQL Query node - Watch Demo
  6. Oracle Data Miner 4.0 (SQL Developer 4.0 Extension) New Features - Watch Demo
  7. Oracle Business Intelligence Enterprise Edition (OBIEE) SampleAppls Demo featuring integration with Oracle Advanced Analytics/Data Mining
  8. Oracle Big Data Analytics Demo mining remote sensor data from HVACs for better customer service 
  9. In-Database Data Mining for Retail Market Basket Analysis Using Oracle Advanced Analytics
  10. In-Database Data Mining Using Oracle Advanced Analytics for Classification using Insurance Use Case
  11. Fraud and Anomaly Detection using Oracle Advanced Analytics Part 1 Concepts
  12. Fraud and Anomaly Detection using Oracle Advanced Analytics Part 2 Demo
  13. Overview Presentation and Demonstration of Oracle Advanced Analytics Database Option

So.... grab your popcorn and a comfortable chair.  Hope you enjoy!


Oracle Advanced Analytics at the Movies

Friday Feb 22, 2013

Take a FREE Test Drive with Oracle Advanced Analytics/Data Mining on the Amazon Cloud

I wanted to highlight a wonderful new resource provided by our partner Vlamis Software.  Extremely easy!  Fill out the form, wait a few minutes for the Amazon Cloud instance to start up and them BAM!  You can login and start using the Oracle Advanced Analytics Oracle Data Miner work flow GUI.  Demo data and online Oracle by Example Learning Tutorials are also provided to ensure your data mining test drive is a positive one,  Enjoy!! 

Test Drive -- Powered by Amazon AWS

We have partnered with Amazon Web Services to provide to you, free of charge, the opportunity to work, hands-on, with the latest of Oracle's Business Intelligence offerings. By signing up to one of the labs below, Amazon's Elastic Cloud Computer (EC2) environment will generate a complete server for you to work with.

These hands on labs are working with the actual Oracle software running on the Amazon Web Services EC2 environment. They each take approximately 2 hours to work through and will give you hands-on experience with the software and a tour of the features. Your EC2 environment will be available for you for 5 hours, at which time it will self-terminate. If, after registration, you need additional time or need further instructions, simply reply to the registration email and we would be glad to help you.

Data Mining

This test drive walks through some basic exercises in doing predictive analytics within an Oracle 11g Database instance using the Oracle Data Miner extension for Oracle SQL Developer. You use a drag-and-drop "workflow" interface to build a data mining model that predicts the likelihood of purchase for a set of prospects. Oracle Data Mining is ideal for automatically finding patterns, understanding relationships, and making predictions in large data sets.

<script type="text/javascript"> var _gaq = _gaq || []; _gaq.push(['_setAccount', 'UA-46756583-1']); _gaq.push(['_trackPageview']); (function() { var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true; ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + ''; var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s); })(); </script>

Tuesday Jan 01, 2013

Turkcell Combats Pre-Paid Calling Card Fraud Using In-Database Oracle Advanced Analytics

Turkcell İletişim Hizmetleri A.S. Successfully Combats Communications Fraud with Advanced In-Database Analytics

[Original link available on]

Turkcell İletişim Hizmetleri A.Ş. is a leading provider of mobile communications in Turkey with more than 34 million subscribers. Established in 1994, Turkcell created the first global system for a mobile communications (GSM) network in Turkey. It was the first Turkish company listed on the New York Stock Exchange.

Communications fraud, or the  use of telecommunications products or services without intention to pay, is a major issue for the organization. The practice is fostered by prepaid card usage, which is growing rapidly. Anonymous network-branded prepaid cards are a tempting vehicle for money launderers, particularly since these cards can be used as cash vehicles—for example, to withdraw cash at ATMs. It is estimated that prepaid card fraud represents an average loss of US$5 per US$10,000 in transactions. For a communications company with billions of transactions, this could result in millions of dollars lost through fraud every year.

Consequently, Turkcell wanted to combat communications fraud and money laundering by introducing advanced analytical solutions to monitor key parameters of prepaid card usage and issue alerts or block fraudulent activity. This type of fraud prevention would require extremely fast analysis of the company’s one petabyte of uncompressed customer data to identify patterns and relationships, build predictive models, and apply those models to even larger data volumes to make accurate fraud predictions.

To achieve this, Turkcell deployed Oracle Exadata Database Machine X2-2 HC Full Rack, so that data analysts can build predictive antifraud models inside the Oracle Database and deploy them into Oracle Exadata for scoring, using Oracle Data Mining, a component of Oracle Advanced Analytics, leveraging Oracle Database11g technology. This enabled the company to create predictive antifraud models faster than with any other machine, as models can be built using search and query language (SQL) inside the database, and Oracle Exadata can access raw data without summarized tables, thereby achieving extremely fast analyses.


A word from Turkcell İletişim Hizmetleri A.Ş.

“Turkcell manages 100 terabytes of compressed data—or one petabyte of uncompressed raw data—on Oracle Exadata. With Oracle Data Mining, a component of the Oracle Advanced Analytics Option, we can analyze large volumes of customer data and call-data records easier and faster than with any other tool and rapidly detect and combat fraudulent phone use.” – Hasan Tonguç Yılmaz, Manager, Turkcell İletişim Hizmetleri A.Ş.

  • Combat communications fraud and money laundering by introducing advanced analytical solutions to monitor prepaid card usage and alert or block suspicious activity
  • Monitor numerous parameters for up to 10 billion daily call-data records and value-added service logs, including the number of accounts and cards per customer, number of card loads per day, number of account loads over time, and number of account loads on a subscriber identity module card at the same location
  • Enable extremely fast sifting through huge data volumes to identify patterns and relationships, build predictive antifraud models, and apply those models to even larger data volumes to make accurate fraud predictions
  • Detect fraud patterns as soon as possible and enable quick response to minimize the negative financial impact


Oracle Product and Services

  • Used Oracle Exadata Database Machine X2-2 HC Full Rack to create predictive antifraud models more quickly than with previous solutions by accessing raw data without summarized tables and providing unmatched query speed, which optimizes and shortens the project design phases for creating predictive antifraud models
  • Leveraged SQL for the preparation and transformation of one petabyte of uncompressed raw communications data, using Oracle Data Mining, a feature of Oracle Advanced Analytics to increase the performance of predictive antifraud models
  • Deployed Oracle Data Mining models on Oracle Exadata to identify actionable information in less time than traditional methods—which would require moving large volumes of customer data to a third-party analytics software—and achieve an average gain of four hours and more, taking into consideration the absence of any system crash (as occurred in the previous environment) during data import
  • Achieved extreme data analysis speed with in-database analytics performed inside Oracle Exadata, through a row-wise information search—including day, time, and duration of calls, as well as number of credit recharges on the same day or at the same location—and query language functions that enabled analysts to detect fraud patterns almost immediately
  • Implemented a future-proof solution that could support rapidly growing data volumes that tend to double each year with Oracle Exadata’s massively scalable data warehouse performance

Why Oracle

“We selected Oracle because in-database mining to support antifraud efforts will be a major focus for Turkcell in the future. With Oracle Exadata Database Machine and the analytics capabilities of Oracle Advanced Analytics, we can complete antifraud analysis for large amounts of call-data records in just a few hours. Further, we can scale the solution as needed to support rapid communications data growth,” said Hasan Tonguç Yılmaz, datawarehouse/data mining developer, Turkcell Teknoloji Araştırma ve Geliştirme A.Ş.


Oracle Partner: Turkcell Teknoloji Araştırma ve Geliştirme A.Ş.

All development and test processes were performed by Turkcell Teknoloji. The company also made significant contributions to the configuration of numerous technical analyses which are carried out regularly by Turkcell İletişim Hizmetleri's antifraud specialists.


<script type="text/javascript"> var _gaq = _gaq || []; _gaq.push(['_setAccount', 'UA-46756583-1']); _gaq.push(['_trackPageview']); (function() { var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true; ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + ''; var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s); })(); </script>

Friday Jun 08, 2012

New Oracle Advanced Analytics presentation

I recently updated my presentation on Oracle's new Advanced Analytics Option which bundles Oracle Data Mining with Oracle R Enterprise for maximum depth and breadth of data mining, statistics and advanced analytic functions from Oracle.  See New Oracle Advanced Analytics presentation.  

Tuesday May 29, 2012

Fraud and Anomaly Detection using Oracle Data Mining YouTube-like Video

I've created and recorded another YouTube-like presentation and "live" demos of Oracle Advanced Analytics Option, this time focusing on Fraud and Anomaly Detection using Oracle Data Mining.  [Note:  It is a large MP4 file that will open and play in place.  The sound quality is weak so you may need to turn up the volume.]

Data is your most valuable asset. It represents the entire history of your organization and its interactions with your customers.  Predictive analytics leverages data to discover patterns, relationships and to help you even make informed predictions.   Oracle Data Mining (ODM) automatically discovers relationships hidden in data.  Predictive models and insights discovered with ODM address business problems such as:  predicting customer behavior, detecting fraud, analyzing market baskets, profiling and loyalty.  Oracle Data Mining, part of the Oracle Advanced Analytics (OAA) Option to the Oracle Database EE, embeds 12 high performance data mining algorithms in the SQL kernel of the Oracle Database. This eliminates data movement, delivers scalability and maintains security. 

But, how do you find these very important needles or possibly fraudulent transactions and huge haystacks of data? Oracle Data Mining’s 1 Class Support Vector Machine algorithm is specifically designed to identify rare or anomalous records.  Oracle Data Mining's 1-Class SVM anomaly detection algorithm trains on what it believes to be considered “normal” records, build a descriptive and predictive model which can then be used to flags records that, on a multi-dimensional basis, appear to not fit in--or be different.  Combined with clustering techniques to sort transactions into more homogeneous sub-populations for more focused anomaly detection analysis and Oracle Business Intelligence, Enterprise Applications and/or real-time environments to "deploy" fraud detection, Oracle Data Mining delivers a powerful advanced analytical platform for solving important problems.  With OAA/ODM you can find suspicious expense report submissions, flag non-compliant tax submissions, fight fraud in healthcare claims and save huge amounts of money in fraudulent claims  and abuse.  

This presentation and several brief demos will show Oracle Data Mining's fraud and anomaly detection capabilities.  

<script type="text/javascript"> var _gaq = _gaq || []; _gaq.push(['_setAccount', 'UA-46756583-1']); _gaq.push(['_trackPageview']); (function() { var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true; ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + ''; var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s); })(); </script>

Thursday May 10, 2012

Oracle Virtual SQL Developer Days DB May 15th - Session #3: 1Hr. Predictive Analytics and Data Mining Made Easy!


Oracle Data Mining's SQL Developer based ODM'r GUI + ODM is being featured in this upcoming Virtual SQL Developer Day online event next Tuesday, May 15th.  Several thousand people have already registered and registration is still growing.  We recorded and uploaded presentations/demos and then anyone can view them "on demand", but at the specified date/time per the SQL DD event agenda.  Anyone can also download a complete 11gR2 Database w/ SQL Developer 3.1 & Oracle Data Miner GUI extension VM installation for the Hands-on Labs and follow our 4 ODM Oracle by Examples e-training.  We moderators monitor the online chat and answer questions. 
Session #3: 1Hr. Predictive Analytics and Data Mining Made Easy!
Oracle Data Mining, a component of the Oracle Advanced Analytics database option, embeds powerful data mining algorithms in the SQL kernel of the Oracle Database for problems such as customer churn, predicting customer behavior, up-sell and cross-sell, detecting fraud, market basket analysis (e.g. beer & diapers), customer profiling and customer loyalty. Oracle Data Miner, SQL Developer 3.1 extension, provides data analysts a “workflow” paradigm to build analytical methodologies to explore data and build, evaluate and apply data mining models—all while keeping the data inside the Oracle Database. This workshop will teach the student the basics of getting started using Oracle Data Mining.
We're also included in the June 7th physical event in NYC and future virtual and physical events.  Great event(s) and great "viz" for OAA/ODM.


Oracle Data Mining Virtual Classes Scheduled

Two Oracle Data Mining Virtual Classes are now scheduled.  Register for a course in 2 easy steps.

Step 1: Select your Live Virtual Class options


Live Virtual Class
Course ID: D76362GC10
Course Title: Oracle Database 11g: Data Mining Techniques
Duration: 2 Days
Price: US$ 1,300 Dollars

Step 2: Select the date and location of your Live Virtual Class

Please select a location below then click on the Add to Cart button


Location  Duration Class Date Class Start Time Class End Time Course Materials Instruction Language Seats Audience Employees
Online 2 Days 09-Aug-2012 04:00 AM EDT 12:00 PM EDT English English Available Public Employees
Online 2 Days 18-Oct-2012 04:00 AM EDT 12:00 PM EDT English English Available Public Employees

100% Student Satisfaction: Oracle's 100% Student Satisfaction program applies to those publicly scheduled and publicly available Oracle University Instructor Led Training classes that are identified as part of the 100% Student Satisfaction program on the website at the time the class is purchased. Oracle will permit unsatisfied students to retake the class, subject to terms and conditions. Customers are not entitled to a refund. For more information and additional terms, conditions and restrictions that apply, click here

Everything about Oracle Data Mining, a component of the Oracle Advanced Analytics Option - News, Technical Information, Opinions, Tips & Tricks. All in One Place


« July 2016