Tika in Action

Tika in Action
Author :
Publisher : Simon and Schuster
Total Pages : 365
Release :
ISBN-10 : 9781638352631
ISBN-13 : 1638352631
Rating : 4/5 (631 Downloads)

Book Synopsis Tika in Action by : Jukka L. Zitting

Download or read book Tika in Action written by Jukka L. Zitting and published by Simon and Schuster. This book was released on 2011-11-30 with total page 365 pages. Available in PDF, EPUB and Kindle. Book excerpt: Summary Tika in Action is a hands-on guide to content mining with Apache Tika. The book's many examples and case studies offer real-world experience from domains ranging from search engines to digital asset management and scientific data processing. About the Technology Tika is an Apache toolkit that has built into it everything you and your app need to know about file formats. Using Tika, your applications can discover and extract content from digital documents in almost any format, including exotic ones. About this Book Tika in Action is the ultimate guide to content mining using Apache Tika. You'll learn how to pull usable information from otherwise inaccessible sources, including internet media and file archives. This example-rich book teaches you to build and extend applications based on real-world experience with search engines, digital asset management, and scientific data processing. In addition to architectural overviews, you'll find detailed chapters on features like metadata extraction, automatic language detection, and custom parser development. This book is written for developers who are new to both Scala and Lift and covers just enough Scala to get you started. Purchase of the print book comes with an offer of a free PDF, ePub, and Kindle eBook from Manning. Also available is all code from the book. What's Inside Crack MS Word, PDF, HTML, and ZIP Integrate with search engines, CMS, and other data sources Learn through experimentation Many examples This book requires no previous knowledge of Tika or text mining techniques. It assumes a working knowledge of Java. ========================================​== Table of Contents PART 1 GETTING STARTED The case for the digital Babel fish Getting started with Tika The information landscape PART 2 TIKA IN DETAIL Document type detection Content extraction Understanding metadata Language detection What's in a file? PART 3 INTEGRATION AND ADVANCED USE The big picture Tika and the Lucene search stack Extending Tika PART 4 CASE STUDIES Powering NASA science data systems Content management with Apache Jackrabbit Curating cancer research data with Tika The classic search engine example

Tika in Action Related Books

Tika in Action
Language: en
Pages: 365
Authors: Jukka L. Zitting
Categories: Computers
Type: BOOK - Published: 2011-11-30 - Publisher: Simon and Schuster

GET EBOOK

Summary Tika in Action is a hands-on guide to content mining with Apache Tika. The book's many examples and case studies offer real-world experience from domain
Lucene in Action
Language: en
Pages: 742
Authors: Otis Gospodnetic
Categories: Computers
Type: BOOK - Published: 2010-07-08 - Publisher: Simon and Schuster

GET EBOOK

When Lucene first hit the scene five years ago, it was nothing short ofamazing. By using this open-source, highly scalable, super-fast search engine,developers
Solr in Action
Language: en
Pages: 939
Authors: Timothy Potter
Categories: Computers
Type: BOOK - Published: 2014-03-25 - Publisher: Simon and Schuster

GET EBOOK

Summary Solr in Action is a comprehensive guide to implementing scalable search using Apache Solr. This clearly written book walks you through well-documented e
A Careful Revolution
Language: en
Pages: 128
Authors: Amelia Sharman
Categories: Science
Type: BOOK - Published: 2019 - Publisher: Bridget Williams Books

GET EBOOK

‘I am 29 years old. I was born just before the Kyoto Protocol was signed, and since then global mean temperatures have risen by an estimated 0.2°C per decade
Machine Learning with TensorFlow, Second Edition
Language: en
Pages: 454
Authors: Mattmann A. Chris
Categories: Computers
Type: BOOK - Published: 2021-02-02 - Publisher: Manning Publications

GET EBOOK

Updated with new code, new projects, and new chapters, Machine Learning with TensorFlow, Second Edition gives readers a solid foundation in machine-learning con