The Oldest Big Data Problem: Parsing Human Language
Posted
by dan.mcclary
on Oracle Blogs
See other posts from Oracle Blogs
or by dan.mcclary
Published on Mon, 15 Oct 2012 17:54:43 +0000
Indexed on
2012/10/15
21:44 UTC
Read the original article
Hit count: 340
/Oracle/Big Data
There's a new whitepaper up on Oracle Technology Network which details the use of Digital Reasoning Systems' Synthesys software on Oracle Big Data Appliance. Digital Reasoning's approach is inherently "big data friendly," as it leverages multiple components of the Hadoop ecosystem. Moreover, the paper addresses the oldest big data problem of them all: extracting knowledge from human text.
You can find the paper here.
From the Executive Summary:
There is a wealth of information to be extracted from natural language, but that extraction is challenging. The volume of human language we generate constitutes a natural Big Data problem, while its complexity and nuance requires a particular expertise to model and mine. In this paper we illustrate the impressive combination of Oracle Big Data Appliance and Digital Reasoning Synthesys software. The combination of Synthesys and Big Data Appliance makes it possible to analyze tens of millions of documents in a matter of hours. Moreover, this powerful combination achieves four times greater throughput than conducting the equivalent analysis on a much larger cloud-deployed Hadoop cluster.
© Oracle Blogs or respective owner