2 min

IBM and Trifacta have jointly created a tool to prepare data for the use of artificial intelligence (AI) and machine learning models. The tool has InfoSphere Advanced Data Preparation and should speed up the preparation of data for self-service analytics.

According to Big Blue and Trifacta, data preparation is an essential step in building machine learning and predictive models, writes Silicon Angle. This is because the data must be extremely accurate, otherwise the models will not be effective. The problem is that data scientists can spend up to 80 percent of their time on this task. The two companies believe that it would be better to spend that time on something else. InfoSphere Advanced Data Preparation should help with this.

InfoSphere Advanced Data Preparation allows data scientists to convert their raw data sets into a format suitable for machine learning models. They can work with their existing data lakes and data warehouses. The tool is designed for formatting, structuring and enriching the datasets for analytical processing and standard reports.

InfoSphere Advanced Data Preperation works by helping users visualize the process of preparing the data so that they can constantly monitor the quality of their data and ensure that there are no formatting errors. This process is fully automated. This allows not only data scientists to prepare and enrich the data for analytical purposes, but also ordinary employees to do so.

Cloud Pak for Data

IBM also announced several updates to its Cloud Pak for Data service. This service is designed to integrate, oversee and manage data across multiple public and private clouds. The updates will make Watson Knowledge Catalog Professional on Cloud Pak for Data available. This should improve the findability of data for analysis and make more governance tools available.

DatStage Edition for Cloud Pak for Data has also been added. This update is intended to reduce latency in data transformations. Watson Discovery for Cloud Pak for Data is an AI search tool for discovering data across multiple clouds.

This news article was automatically translated from Dutch to give Techzine.eu a head start. All news articles after September 1, 2019 are written in native English and NOT translated. All our background stories are written in native English as well. For more information read our launch article.