Cataloging Unstructured Data in IBM Watson Knowledge Catalog with IBM Spectrum Discover

Cataloging Unstructured Data in IBM Watson Knowledge Catalog with IBM Spectrum Discover
Author :
Publisher : IBM Redbooks
Total Pages : 108
Release :
ISBN-10 : 9780738459028
ISBN-13 : 073845902X
Rating : 4/5 (02X Downloads)

Book Synopsis Cataloging Unstructured Data in IBM Watson Knowledge Catalog with IBM Spectrum Discover by : Joseph Dain

Download or read book Cataloging Unstructured Data in IBM Watson Knowledge Catalog with IBM Spectrum Discover written by Joseph Dain and published by IBM Redbooks. This book was released on 2020-08-11 with total page 108 pages. Available in PDF, EPUB and Kindle. Book excerpt: This IBM® Redpaper publication explains how IBM Spectrum® Discover integrates with the IBM Watson® Knowledge Catalog (WKC) component of IBM Cloud® Pak for Data (IBM CP4D) to make the enriched catalog content in IBM Spectrum Discover along with the associated data available in WKC and IBM CP4D. From an end-to-end IBM solution point of view, IBM CP4D and WKC provide state-of-the-art data governance, collaboration, and artificial intelligence (AI) and analytics tools, and IBM Spectrum Discover complements these features by adding support for unstructured data on large-scale file and object storage systems on premises and in the cloud. Many organizations face challenges to manage unstructured data. Some challenges that companies face include: Pinpointing and activating relevant data for large-scale analytics, machine learning (ML) and deep learning (DL) workloads. Lacking the fine-grained visibility that is needed to map data to business priorities. Removing redundant, obsolete, and trivial (ROT) data and identifying data that can be moved to a lower-cost storage tier. Identifying and classifying sensitive data as it relates to various compliance mandates, such as the General Data Privacy Regulation (GDPR), Payment Card Industry Data Security Standards (PCI-DSS), and the Health Information Portability and Accountability Act (HIPAA). This paper describes how IBM Spectrum Discover provides seamless integration of data in IBM Storage with IBM Watson Knowledge Catalog (WKC). Features include: Event-based cataloging and tagging of unstructured data across the enterprise. Automatically inspecting and classifying over 1000 unstructured data types, including genomics and imaging specific file formats. Automatically registering assets with WKC based on IBM Spectrum Discover search and filter criteria, and by using assets in IBM CP4D. Enforcing data governance policies in WKC in IBM CP4D based on insights from IBM Spectrum Discover, and using assets in IBM CP4D. Several in-depth use cases are used that show examples of healthcare, life sciences, and financial services. IBM Spectrum Discover integration with WKC enables storage administrators, data stewards, and data scientists to efficiently manage, classify, and gain insights from massive amounts of data. The integration improves storage economics, helps mitigate risk, and accelerates large-scale analytics to create competitive advantage and speed critical research.


Cataloging Unstructured Data in IBM Watson Knowledge Catalog with IBM Spectrum Discover Related Books

Cataloging Unstructured Data in IBM Watson Knowledge Catalog with IBM Spectrum Discover
Language: en
Pages: 108
Authors: Joseph Dain
Categories: Computers
Type: BOOK - Published: 2020-08-11 - Publisher: IBM Redbooks

GET EBOOK

This IBM® Redpaper publication explains how IBM Spectrum® Discover integrates with the IBM Watson® Knowledge Catalog (WKC) component of IBM Cloud® Pak for D
Cataloging Unstructured Data in IBM Watson Knowledge Catalog with IBM Spectrum Discover
Language: en
Pages:
Authors: Joseph Dain
Categories: Database management
Type: BOOK - Published: 2020 - Publisher:

GET EBOOK

IBM Spectrum Discover: Metadata Management for Deep Insight of Unstructured Storage
Language: en
Pages: 152
Authors: Joseph Dain
Categories: Computers
Type: BOOK - Published: 2019-10-01 - Publisher: IBM Redbooks

GET EBOOK

This IBM® Redpaper publication provides a comprehensive overview of the IBM Spectrum® Discover metadata management software platform. We give a detailed expla
Making Data Smarter with IBM Spectrum Discover: Practical AI Solutions
Language: en
Pages: 170
Authors: Ivaylo B. Bozhinov
Categories: Computers
Type: BOOK - Published: 2020-10-19 - Publisher: IBM Redbooks

GET EBOOK

More than 80% of all data that is collected by organizations is not in a standard relational database. Instead, it is trapped in unstructured documents, social
IBM Spectrum Discover
Language: en
Pages:
Authors: Joe Dain
Categories: Database management
Type: BOOK - Published: 2019 - Publisher:

GET EBOOK