The Textual Data Warehouse
The value of a text based data warehouse is unquestioned. But how are text based data warehouses created?
This one day tutorial addresses what some of the major issues of building a text based data warehouse are and how those obstacles are over come. Some of the issues that are addressed include small subjects such as date standardization and textual numeric conversion. Other issues are much larger, such as addressing the issues of terminology and managing the logical subdivision of documents.
- what are the issues in building the textual based data warehouse
- terminology and logical subdivision of text
- inputting spreadsheets
- OCR
- defining delimiters
- combining textual data with classical structured data
For years corporate decisions have been made on the basis of the data found in transaction based systems. Transaction oriented data fits well with standard data base management systems because data base management systems structure data in a repetitive manner, where each occurrence of data has the same structure as each other occurrence of data in a table. But there is another viable and important source of data in the corporation. That source of data is the information found in the form of text. There are many forms of text in the corporation – emails, spreadsheets, contracts, warranties, medical and healthcare information, and so forth. Because text is not repetitive it does not fit easily and well with standard data base management systems.
But now there is textual ETL and the ability to build data bases and data warehouses that contain textual information. When textual data is able to be transformed so that the text fits inside a standard data base management system, whole new opportunities for analysis and decision making are created.
This one day lecture/workshop is about what is required to create the textual, unstructured data warehouse. The morning is lecture and the afternoon is a hands on workshop where data bases for the data warehouse will be built from text. All examples are in English.
Is this for me
This seminar/workshop is for people who are interested in the mechanics of taking text and producing an analytical data base from that text. Data architects, business people, project managers, technicians are all welcome in this class.
Reserveer daarom in uw agenda: 19 mei 2010!
Productinformatie
19 mei 2010The Textual Data WarehouseHoe worden tekstgebaseerde datawarehouses eigenlijk gecreëerd? U leert het in dit ééndaagse seminar.
Locatie: Intres, Hoevelaken
Home
Ook interessant