Document Type : Research Article

Authors

Technical University of Munich, Chair for Information Systems (i17), Boltzmannstr. 3, 85748 Garching by Munich

Abstract

Data Virtualization (DV) has become an important method to store and handle data cost-efficiently. However, it is unclear what kind of data and when data should be virtualized or not. We applied a design science approach in the first stage to get a state of the art of DV regarding data integration and to present a concept matrix. We extend the knowledge base with a systematic literature review resulting in 15 critical success factors for DV. Practitioners can use these critical success factors to decide between DV and Extract, Transform, Load (ETL) as data integration approach.

Keywords

[1] JaneWebster and Richard T.Watson. Analyzing the past to prepare for the future: Writing a literature review. MIS Quarterly, 26(2):xiii–xxiii,2002.
[2] Alan R Hevner, Salvatore T March, Jinsoo Park, and Sudha Ram. Design science in information systems research. MIS Quarterly, 28(1):75–105, 2004.
[3] N. Yuhanna. The forrester wave: Enterprise data virtualization, q4 2017: The 13 vendors that matter most and how they stack up. Forrester,2017.
[4] Denodo. Data virtualization and etl. 2014.
[5] N. D. Bhatti. Overcoming data challenges with virtualization. Business Intelligence Journal,(Vol. 48, No. 4), 2013.
[6] A. R. Bologa and R. Bologa. A perspective on the benefits of data virtualization technology. Informatica Economica, (Vol. 15, No. 4):110âAS118,2011.
[7] R. F. Van der Lans. Data virtualization for business intelligence systems: Revolutionizing data integration for data warehouses. The Morgan Kaufmann Series on Business Intelligence. Elsevier/ Morgan Kaufmann, Amsterdam, 2012.
[8] S. Vinay. Logical data warehousing for big data: Extracting value from the data! Gartner, 2012.
[9] P. Moxon. Data integration alternatives. 2015.
[10] T. Grosser and N. Janoschek. Datenmanagement im wandel: Data warehousing und datenintegration
im zeitalter von self service und big data.2014.
[11] M. Voet. Data virtualization is a revenue generator,2018.
[12] J. E. Powell. Enabling bi agility with data virtualization. Business Intelligence Journal, (Vol.16, No. 4):53âAS55, 2011.
[13] S. Earley. Data virtualization and digital agility. IT Professional, 18(5):70âAS72, 2016.
[14] F. Farooq. The data warehouse virtualization framework for operational business intelligence. Expert Systems, 30(5):451âAS472, 2013.
[15] R. F. Van der Lans. Developing a bi-modal logical data warehouse architecture using data virtualization: A whitepaper. R20/Consultancy,2016.
[16] M. Ferguson. Succeeding with data virtualization: High value use cases for operational and data management data services. 2011.
[17] R. Shankar. Enabling self-service bi with a logical data warehouse. Business Intelligence Journal,
(Vol. 22, No. 3), 2017.
[18] Denodo. Deploying data virtualization at an enterprise scale âAS a journey towards an agile, data-driven infrastructure. 2017.
[19] R. Kimball. The data warehouse toolkit: The definitive guide to dimensional modeling. J. Wiley & Sons, [Erscheinungsort nicht ermittelbar], 3rd ed. edition, 2013.
[20] A. Chandramouly, N. Patil, R. Ramamurthy, S. R. Krishnan, and J. Story. Integrating data warehouses with data virtualization for bi agility.2013.
[21] Data Virtuality. Der komplette guide zur datenintegration: Einfache datenintegration im digitalen zeitaler e-book. 2014.
[22] Denodo and IBM. Achieve value and insight with ibm big data analytics and denodo data virtualization. 2014.
[23] Denodo. Data virutalization goes mainstream: Solving key data integration challenges with more agility than traditional technologies for structured, unstructured, web, cloud and big data sources. 2014.
[24] Denodo. Data virtualization usage patterns for business intelligence/data warehouse architectures. 2016.
[25] Denodo. Denodo platform 6.0. 2016.
[26] Denodo. Die 10 hÃdufigsten fragen: Datenvirtualisierung.2016.
[27] Denodo. Realizing the promise of self-service analytics. 2017.
[28] Denodo. Overcoming telecommunications challenges with data virtualization. 2018.
[29] M. Ferguson. Succeeding with data virtualization: High value use cases for analytical data services. 2011.
[30] M. Goetz and N. Yuhanna. Create a road map for a real-time, agile, self-service data platform: Road map: The data management playbook. Forrester,2015.
[31] S. S. Guo, Z. M. Yuan, A. B. Sun, and Q. Yue. A new etl approach based on data virtualization. Journal of Computer Science and Technology, 30(2):311âAS323, 2015.
[32] B. Hopkins. Data virtualization reaches critical mass: Technology advancements, new patterns, and customer successes make this enterprise technology both a short- and long-term solution. Forrester, 2011.
[33] D. Loshin. Effecting data quality improvement  through data virtualization. 2010.
[34] M. Matzer and C. Kurze. Datenvirtualisierung: Bindeglied zwischen verteilten datensilos zum aufbau flexibler analytischer ÃUkosysteme. 2017.
[35] A. H. Mousa and N. Shiratuddin. Data warehouse and data virtualization: Comparative study. page 369âAS372, 2015.
[36] P. Russom. Data integration for real-time data warehousing and data virtualization. TDWI,2010.
[37] M. Schroeck, R. Shockley, J. Smart, D. Romero-Morales, and P. Tufano. Analytics: Big data in der praxis: Wie innovative unternehmen ihre datenbestÃdnde effektiv nutzen. 2012.
[38] TIBCO. Applying data virtualization: 13 usecases that matter. 2017.
[39] TIBCO. Ten things you need to know about data virtualization. 2017.
[40] TIBCO. Tibco data virtualization technical overview: Tibco data virtualization deployment, development, run-time, and management capabilities.2017.
[41] TIBCO. Data virtualization: Achieve better business outcomes more quickly. 2018.
[42] R. F. Van der Lans. Designing a data virtualization environment a step-by-step- approach: A technical whitepaper. 2016.
[43] R. F. Van der Lans. Designing a logical data warehouse: A technical whitepaper. 2016.
[44] R. F. Van der Lans. Data virtualization in the time of big data: A technical whitepaper. 2017.
[45] R. F. Van der Lans. Architecting the multipurpose data lake with data virtualization. 2018.
[46] N. Yuhanna and M. Gilpin. The forrester wave: Data virtualization, q1 2012. Forrester, 2012.