There is multiple solution to get started on the WebLab platform. However one should note few things first:
- WebLab is a platform developed through research projects and not meant to be use as it for an operational application ;
- WebLab is an integration platform dedicated to unstructured document processing and retrieval (and more...) ;
- WebLab is a server application and even if the demo is packaged to be easily deployed and installed on a single machine for test purpose, most applications based on WebLab are multi-servers application ;
This being said, here are the possible way to get in touch with the WebLab.
Test the demo
The easiest way is to download the WebLab demonstration package available on the OW2 forge. It's a simple Zip file and once extracted you should be able to easily install and launch a simple demo application based on WebLab. The scope of the demonstration is really "reduced to the max" in order to be able to run on a single machine. So it can:
- get documents from a local folder ;
- normalize and extract text content from standard desktop text files ;
- automatically extract information about named entities in the text in English ;
- index the results ;
- provide access to the processed documents through a simple text search engine and simple document visualization tool.
On the hardware side, the demo is configured to run on an advanced computer (well not so advanced but not an "office" configuration):
- minimal specification : dual core with 4Go of memory and 400Mo of disk space
- recommended specification : quad core with 8Go and 500Mo of disk space (and fast hard disk are of course recommended)
Then on the capability side this demonstration include less than 10 documents but it has been tested on more than 10000. So you can try with your own documents.