Programma 'The Textual Data Warehouse'
Morning Presentation

An Introduction To Unstructured Data

This Powerpoint presentation shows what the issues of unstructured data are, what the possibilities, and what opportunities there are. The attendee is introduced to textual ETL and the creation of the unstructured data base. The issues of integration are discussed amply.

Issues of Textual Integration
There are many issues in the integration of unstructured data and the transformation of unstructured data to an analytical relational data base.  Some of the issues are
- the issue of terminology
- the issue of logical sib divisions of data with a document
- the issues of clustering
- the issues of proximity
- the issues of filtering unnecessary data

These and many other issues are discussed in the context of reading in raw text and creating a viable analytical data base.

Afternoon Workshop

In the afternoon Textual ETL will be run producing a wide variety of data bases/data warehouses using many of the features of Textual ETL. The attendees will observe and participate in the transformation of text into a data base ready for analytic processing.
 
The workshop begins by examining some textual data. A strategy for capturing and organizing the text is discussed. Then the workshop continues with several types of processing that are done dynamically, under the purview of the attendees. Some of the types of processing that are done include:
- document metadata capture
- document fracturing
- named value indexing
- simple indexing
- semistructured indexing
- merged indexing.

Depending on the textual data that has been selected, some or all of these kinds of indexes will be chosen and created.
 
Many of the features of Textual ETL will be used during the workshop.

At the end of the workshop the attendees may process some documents that they have brought to the workshop to see what the processing of textual ETL can do.
 
Sponsor
Productinformatie
19 mei 2010
The Textual Data Warehouse

Hoe worden tekstgebaseerde datawarehouses eigenlijk gecreëerd? U leert het in dit ééndaagse seminar.

Locatie: Intres, Hoevelaken

Home
Programma
Spreker
Plaats
Kosten

Ook interessant
 BI-event 2010
Congres met internationale keynote sprekers Bill Inmon, Mark Greaves en Rick van der ...