Apache Tika Toolkit for Content Detection and Analyisis
Article describes Apache Tika Toolkit as a one-stop shop for identifying, retrieving and parsing text and metadata from more than 1,200 file formats, such as HTML, PDF, images, OpenOffice, Microsoft Office and email.
Would you like to know more or ask a question?
Feel free to ask a question on Twitter, Facebook or using the contact form. You can also subscribe to websitePlus enews to get more hints, resources, how-to's and other informative information about social media and website strategy for small business.



