Director, Information Retrieval and Data Science Group and Adjunct Associate Professor, USC and Principal Data Scientist, NASA
I am a Principal Data Scientist and the Chief Architect in the Instrument and Data Systems section, at the Jet Propulsion Laboratory (JPL) in Pasadena, California. I also am Director of the Information Retrieval and Data Science Group (IRDS) and Adjunct Associate Professor in the Computer Science Department within USC's Viterbi School of Engineering.
At USC, I teach CSCI 599: Content Detection and Analysis for Big Data, a new course in the Data Science track. I used to teach CSCI 572: Information Retrieval and Search Engines and CSCI 578: Software Architectures.
I wrote the Tika in Action book with Jukka Zitting and published by Manning Publications. Tika in Action is the definitive guide to a popular software framework for content detection and analysis that I co-invented (with Jérôme Charron) called Apache Tika.

Searching deep and dark: Building a Google for the less visible parts of the web
Jan 09, 2017 05:21 am UTC| Technology
In todays data-rich world, companies, governments and individuals want to analyze anything and everything they can get their hands on and the World Wide Web has loads of information. At present, the most easily indexed...