On Monday May 20th, the Semantic Data Factory goes live. After many months of hard work together with our partners from Recognos Financial, DTCC and Delta Data the data factory is going live providing a long expected Industry Utility for the US Mutual Funds. The data factory has a “data assembly line” that combines an automatic NLP based data extraction process with a data correction and QA functionality where data specialists are correcting or enhancing the data that was not extracted correctly . The main advantages of our method is the high efficiency of the process. continuous improvement of the extraction process based on the accumulated experience and the maintenance of the provenance meta data associated with the result set. The system accumulated years of experimentation and prototype building in our Romanian office, that was a process involving data specialists, semantic NLP experts, distributed system developers, etc. Together with our partners from Recognos Financial in New York, we believe that the future of the financial data industry is here and we are preparing new “data production lines” for other complex financial data sets that need to be extracted from heterogeneous formatted and unformatted data sources.
Archive for the ‘Events’
|In October 2012, Recognos Romania joined the CLUJ IT initiative as one of the 31 founding members. CLUJ IT is a cluster association of organizations from the Cluj-Napoca area, aiming to enhance innovation and competitiveness of the Romanian IT sector. It comprises 23 privately owned companies, 2 universities (Babes-Bolyai University and the Technical University of Cluj-Napoca) and 6 public institutions.|
|The main aim of this initiative is to build an ecosystem for the development and commercialization of innovative, value-adding software services and products through strong cooperation between the cluster members, public-private partnership and fostering of R&D and innovation.
The association was formally constituted in October 2012 and officially launched on the 14th of November in Bucharest.
Recognos will participate at the Semantic WEB Conference in Washington DC (November 29 – December 1st). The Recognos presentation “Application of Semantic Technology in Document Management” will take place on Wednesday, November 30, 2011 01:50 PM – 02:40 PM in Ballroom D. For details click here.
The 2011 Semantic Technology Conference (#SemTech) will be held at the Hilton Union Square in downtown San Francisco on June 5-9, 2011. Now in its sixth year, SemTech 2011 is the world’s largest educational conference for the community of executives, technologists, researchers, investors and customers involved with semantic technologies.
SemTech 2011 features five days of presentations, panels, tutorials, announcements, new company/product launches, and conversations. It’s a place for new learning, professional networking, and business development.
Semantic WEB Summit East
Boston , November 16-17,2010
The Semantic WEB Summit east was a small gathering of people interested mostly in the applications of the Semantic WEB. The main trend that can be seen is that the number of real implementations is increasing and large organizations like the US Department of Defense, Best Buy, Overstock.com, BBC, Merck, etc. are more and more interested in the new technology. The main applications are in the following areas:
1) Dealing with change of data – an alternative to data warehouses
2) Business intelligence solutions
3) Intelligent search
4) Unstructured document processing
5) Context determination from documents
This document represents a summary of the presentations with references to information sources related to those.
- Lee Feigenbaum – VP of technology www.cambridgesemantics.com
Using EXCEL as the UI for semantic applications , data integration. They have an interesting tool, presented applications in life sciences i.e. the medical drug industry.
Presented the differences between Semantic Technology and Semantic WEB technology. The Semantic WEB Technology is a part of the Semantic Technology and refers to technologies like RDF, RDFS, SPARQL, RDFa applied to the information exposed on the Internet. The Semantic technology is a broader term and refers to Data Mining, Unstructured Data Processing, Semantic Search, heterogeneous data integration, knowledge processing, etc.
- Michael Lang, CEO Revelytix
Revelytix is a company that works a lot with the US Military. They provide tools for:
- collaboration in building Ontologies by communities – wiki type of system – open source can be downloaded (www.knoodle.com). We can get support from them if we want to use it. We can invite them to a meetup to show you the system remotely.
Michael presented an Ontology based architecture to build semantic applications. They are using 3 ontologies: Domain Ontology, Mapping Ontology (built using the Open Source Data mapping tool – D2RQ) , Metatdata Ontology,. He mentioned the migration from the EIW (Enterprise Information Warehouse to Enterprise Information Web). The US Department of Defense is using this model to define their future architecture.
- Duane Degler – How to design the UI for Semantic Applications.
He mentioned a very interesting project from MIT, that has a lot of Open Source widgets used to visualize large amounts of data:
He mentioned an interesting new browser interface that can be used, named Freebase Parallax. http://www.freebase.com/labs/parallax/
Also Drupal 7 was mentioned as one of the best environment to develop Semantic Social network applications supporting: SPARQL endpoints, to create semantic metadata for the posting,
- YY Lee – COO FirstRain – http://www.firstrain.com/
YY is the brain behind their system that data mines multiple sources of data in order to determine trends in the market.
- Marco Neumann – Lotico – www.lotico.com
Marco is the initiator of the world Semantic Meetup trend. He collects the data from these Meetups and exposes them in a RDF Data API.
I will work with them to include the Cluj meetup in the Lotico group. Marco mentioned about a new platform that he uses. Is a news aggregator which is worthwhile to check out:
- Open Amplify – www.openamplify.com
Text Classification tool that can be used for market Sentiment analysis. The new product is named Ampliverse and is used to extract taxonomies from text.
( Is like the Open Calais platform – http://www.opencalais.com/ )
- Making the case for Semantic Technology in Business – panel
Scott Brinker – www.chiefmartec.com – worth to read a study about marketing spending
Martin Hepp – CEO Hepp research GmbH – former STI member – he created Good Relations – the common WEB vocabulary for eCommerce used by BestBuy, Overstock.com and O Reilly media.
The site presents the concept and how to use it. The Good Relations is about creating a common vocabulary to describe products including the semantic description and display aspects in order to be used by retailers who are selling certain products.
The idea is to make the product the most findable for search engines.
It can be used by pasting RDFa formats into WEB pages this making them findable by semantic search engines. I believe that this will be successful in the future because everybody’s interest (producer, marketer and consumer) is to find the product in the simplest way.
Jay Myers – www.bestbuy.com
Jay is the promoter of Semantic Technologies for www.bestbuy.com. He presented the way how the large retailer is implementing the Semantic technologies mentioning the strategic formula for this: integration with externally facing open data (tagged with RDFa) and Internal Linked Data through SPARQL endpoints used to live query internal dynamic data sources. To see the importance for best buy of analyzing this data, check out this news:
- Semantic technology in the US Department of Defense
Dennis Wisnosky is the CTO and Chief Architect for the Department of Defense. The US DOD is the world larges organization having almost 2 million employees all over the world. The main idea that Dennis is promoting is the replacement of the data Warehouse approach with Semantics for data integration. This is important for other organizations showing the the technology is maturing so that can be integrated in such a strategic program for the US DOD. Another very interesting fact is the use of Open Source in their strategic architecture.
The common vocabulary used in this can be seen at:
- Mathew Petrillo – Ontotext – Manging Unstructured Content
Ontotext is a Bulgarian company that participates in multiple European Projects and develops tools and platforms for Semantic technology.
Some of the used platforms / tools are:
- LarKC – http://www.larkc.eu/ – the Large Knowledge Collider
- OWLIM 3.4 – http://semanticweb.org/wiki/OWLIM – RDF management system (competes with Franz)
- Ontotext created a JENA adapter kit for OWLLIM – to solve scalability issues for scalability
- Ontotext uses the Joseki server for SPARQL – http://www.joseki.org/
- Uses TopQuadrant for Ontology Management
- Fluid Ops – for hosting
- SaltLux.com – Korea
- BPEng – http://www.bpeng.com/ – Italy
- Profium – Finland – http://www.profium.com/ – Semantic Content Management
- Uses GATE
- Expert System announced a spin-off of Admantx (http://www.admantx.com/ ) a vertical build on COGITO for semantic advertising.
- Rob Gonzales – Cambridge Semantics
Presented Anzo (http://semanticweb.com/cambridge-semantics-launches-semantic-platform-and-tools-for-non-technical-users-includes-excel-plug-in-and-web-front-end_b13239 ) a semantic platform . They applied the Semantic Platform (which uses EXCEL as UI) in applications for the bio tech companies. The applications were implemented in three areas of the companies :
a) Assay management in drug research
b) Manufacturing Quality Control
c) Salesforce optimization
They announced a partnership with the CRAY supercomputer company.
- Rachel Lovinger – Razorfish
Rachel presented the semantic applications in the media and indicated to download a very interesting document:
They are using the Dublin Core http://dublincore.org/documents/usageguide/ for tagging, FOAF for microformats Good Relations for Common Vocabulary, RDFa. Microformat, HTML5. She mentioned how active is BBC in this space.
Technologies: Open Publish – Drupal and Open Calais);
You can see her presentation at:
- Stephen Wolfram – www.WolframAlpha.com
He is an amazing guy. He explained the way how WolframAlpha works. Stephen was working for 25 years on Mathematica language implementation. He is an expert in Complexity Theory.
Here is how Wolfram Alpha works (try it out, is amazing !!!!):
- They collected a very large amount of data from authoritative sources (not folksonomies – not from the web). The made this data computable for the Mathematica engine.
- Every thing that is computed needs to be translated in multiple lines of Mathematica code.
- How is the interaction with WolframAlpha ? is using natural language. For each question a disambiguation process takes place. If the system cannot determine the question in one way, a set of options is offered to the users and the user needs to pick the right option. (check out the site by addressing questions) They also use folksonomies as additional tools for disambiguation.
- What is the answer ? They bring in everything what they know about the question: tables, graphs, maps, etc. is amazing !!!!
He used the following examples that you can try out:
“uncle’s brother son”
“oil production in the world”
“banana consumption Europe”
What is next for WolframAlpha:
- launch a new format for documents named Computer Document Format used in creating interactive documents
- bringing the Wolfram to the new PDAs – Droids
- combine Mathematica with Wolfram Alpha
He showed some amazing examples. For example he took a [picture with a camera, placed it on the desktop, dragged it on a sheet with a mathematica code pasted in between brackets and put:
Edgedetect [ here was the effective picture ] – the system draw an image with the edges of the picture.
He also showed a code like:
Plot sinx — draw the sin function graph
Add Red Frame – the system put the picture in a red frame
Add yellow background – the system colored the background in yellow
He wants to create the possibility for non programmers to program in natural language. Amazing !
Recognos Financial in cooperation with Expert Systems will present at the New York Semantic WEB Meetup on JUne 10, 2010. Click here for details.
The first Semantic Meetup in Romania will meet on June 11,2010 at the offices of Recognos Romania. To see more details click here.
June 21st — 25th, 2010 Semantic Technology Conference— at Hilton Union Square, San Francisco, California
Now in its sixth year, SemTech 2010 is the foremost place to learn about the commercialization of Semantic Technologies.
Tags: Semantic Web Web 3.0 ontology folksonomy taxonomy natural search natural language conference workshop tutorial education learn
The Recognos Group participated with papers at SemTech 2009, June 14-18 San Jose, CA, USA CA, USA and KEPT 2009, July 2-4 Cluj-Napoca, Romania).
The papers present the experience of the Recognos Group in developing commercial semantic applications. The Recognos Group consists of three companies: Recognos Inc., San Francisco, CA, USA, Recognos Romania Ltd., Cluj-Napoca, Romania and Recognos Financial LLP, New York, USA.
The papers that were presented are:
“Semantic Technologies in the Financial Services Industry” Drew Warren, Recognos Financial, New York, USA George Roth, San Francisco, CA , Recognos Inc.
“Integrate heterogeneous data using semantic technologies”, Robert Baban, Adrian Petrescu – Recognos Romania, Cluj Napoca, Romania
Recognos Group participates at the IT Gartner Expo in Orlando, Florida, USA.