Zum Inhalt
Zur Navigation

Text preprocessing for Trend Mining

In order to retrieve RDF-triples from a text stream, we have to preprocess texts. Our preprocessing component uses statistical parser from Stanford University and creates a timeline with parsed texts.
Trend Mining corpus (in cooperation with neofonie GmbH)
Texts from our Trend Mining corpus are stored in 2 data bases: finance and mafo.Finance has 21 tables named by the document sources: chats, information boards, etc.  Finance corpus consists of 276,587 documents. Mafo has 27 tables named by the document sources: chats, information boards, etc. Mafo corpus consists of 74,145 documents.
Chemisches Zentralblatt corpus (in cooperation with FIZ Chemie GmbH):
We develop approaches for semantic preprocessing of chemical texts from 19th century.
Further description available soon.

Team

Latest News

2010-08-25 08:52

Corporate Semantic Web @ Xinnovations

Die im vergangenen Jahr entwickelten Konzepte wurden inzwischen in Demonstratoren umgesetzt und werden in kurzen Vorträgen auf dem Workshop vorgestellt. Des Weiteren präsentieren Referenten aus Unternehmen Anwendungen, die auf semantischen Technologien aufsetzen.

Weiterlesen …

2010-07-27 23:38

6th Berlin Semantic Web Meetup at Xinnovations 2010

We invite you to the 6th Berlin Semantic Web Meetup on 14 September 2010 at 5 p.m.

Weiterlesen …

2010-07-11 19:47

Tutorial: Event Processing Architectures at DEBS 2010

Corporate Semantic Web at DEBS 2010 in Cambridge, United Kingdom

Weiterlesen …

Latest Publication

Linked Data Authoring for Non-Experts

Markus Luczak-Rösch and Ralf Heese, Linked Data on the Web Workshop at WWW2009, Madrid, Spain, April 20, 2009

© 2008 FU Berlin | Feedback
This work has been partially supported by the  InnoProfile-Corporate Semantic Web project funded by the German Federal Ministry of Education and Research (BMBF) and the BMBF Innovation Initiative for the New German Länder - Entrepreneurial Regions.