Cataloging Unstructured Data in IBM Watson Knowledge Catalog with IBM Spectrum Discover
Joseph Dain, Abeer Selim, Anil Patil, Christopher Vollmar, Flavio de Rezende, Frank Greco, Frank N. Lee, Isom Crawford Jr., Ivaylo B. Bozhinov, Joanna Wong, Joshua Blumert, Larry Coyne, IBM Redbooks
IBM Redbooks, Aug 11, 2020 - Computers - 108 pages
This IBM® Redpaper publication explains how IBM Spectrum® Discover integrates with the IBM Watson® Knowledge Catalog (WKC) component of IBM Cloud® Pak for Data (IBM CP4D) to make the enriched catalog content in IBM Spectrum Discover along with the associated data available in WKC and IBM CP4D. From an end-to-end IBM solution point of view, IBM CP4D and WKC provide state-of-the-art data governance, collaboration, and artificial intelligence (AI) and analytics tools, and IBM Spectrum Discover complements these features by adding support for unstructured data on large-scale file and object storage systems on premises and in the cloud.
Many organizations face challenges to manage unstructured data. Some challenges that companies face include:
This paper describes how IBM Spectrum Discover provides seamless integration of data in IBM Storage with IBM Watson Knowledge Catalog (WKC). Features include:
Several in-depth use cases are used that show examples of healthcare, life sciences, and financial services.
IBM Spectrum Discover integration with WKC enables storage administrators, data stewards, and data scientists to efficiently manage, classify, and gain insights from massive amounts of data. The integration improves storage economics, helps mitigate risk, and accelerates large-scale analytics to create competitive advantage and speed critical research.