My perspective on features that would benefit self-service data preparation

My perspective on features that would benefit self-service data preparation
admin | March 21st, 2016

Download PDF

What are some useful features that would benefit users of self-service data preparation offerings? Here is my list:

  • Tighter integration with Business Intelligence tools; the BI user should be able to drill down into the Data Preparation pipeline from BI tools to further understand the underlying transformations and explore the data.
  • Scalability of the infrastructure behind the data preparation process. Ability to add nodes to scale out the preparation process.
  • Holistic view of the data preparation processes and pipelines. Let’ say that two business units have their own distinct instances of data preparation software. How can they gain a view into pipelines used by each other, or share with each other or even call one pipeline from another pipeline? A holistic view will provide opportunities to unify things in an organization rather than creating siloed pipelines.
  • Ability to support content-oriented data sources such as PDF.
  • Ability to prepare JSON data in an optimized fashion.
  • Eliminating security concerns around moving the data into the data preparation platform as part of the preparation processing. It is important to ensure that the data is secured at all times.
  • Support for NoSQL data sources.
  • On-premises offerings in addition to cloud offerings.
  • Leveraging in-database processing as appropriate.

Category: Big Data Trends Data preparation