CFP last date
22 April 2024
Reseach Article

Infobright Enterprise Edition Analytic Data Warehouse Technology An overview

Published on March 2012 by Juned A. Khan, Ajit P. Shiralkar
2nd National Conference on Innovative Paradigms in Engineering and Technology (NCIPET 2013)
Foundation of Computer Science USA
NCIPET - Number 15
March 2012
Authors: Juned A. Khan, Ajit P. Shiralkar
666d1356-df91-4e0c-8979-1cd3fcd3a15d

Juned A. Khan, Ajit P. Shiralkar . Infobright Enterprise Edition Analytic Data Warehouse Technology An overview. 2nd National Conference on Innovative Paradigms in Engineering and Technology (NCIPET 2013). NCIPET, 15 (March 2012), 36-40.

@article{
author = { Juned A. Khan, Ajit P. Shiralkar },
title = { Infobright Enterprise Edition Analytic Data Warehouse Technology An overview },
journal = { 2nd National Conference on Innovative Paradigms in Engineering and Technology (NCIPET 2013) },
issue_date = { March 2012 },
volume = { NCIPET },
number = { 15 },
month = { March },
year = { 2012 },
issn = 0975-8887,
pages = { 36-40 },
numpages = 5,
url = { /proceedings/ncipet/number15/5309-1121/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 2nd National Conference on Innovative Paradigms in Engineering and Technology (NCIPET 2013)
%A Juned A. Khan
%A Ajit P. Shiralkar
%T Infobright Enterprise Edition Analytic Data Warehouse Technology An overview
%J 2nd National Conference on Innovative Paradigms in Engineering and Technology (NCIPET 2013)
%@ 0975-8887
%V NCIPET
%N 15
%P 36-40
%D 2012
%I International Journal of Computer Applications
Abstract

Over the last few decades, business intelligence has emerged as one of the highest priority items on CIO agendas. Businesses and government agencies know that mining information from the increasingly large volumes of data they collect is critical to their business or mission. During this same period, a number of other factors have contributed to the high rate of growth of business intelligence (BI) and data warehousing (DW) technologies including: • Many more users with diverse needs • Need for ad hoc queries vs. standard canned reports • Need for more “real time” information • Growth of the number of databases within an organization, with need for consolidation of information. • Rapidly growing volumes of data • Growth of internet and web-based applications, including selfservice applications • Regulatory/legal requirements Traditional approaches to data warehousing have significant drawbacks in terms of effectively delivering a solution to the business for such diverse requirements. These drawbacks include high licensing and storage cost, slow query performance against large data volumes, and difficulty in providing access to all of the data stored in multiple databases. At the same time, these growing issues are placing a high burden on DBAs and IT organizations to implement, tune, and manage databases that supply BI information. The result is that business users are frustrated with how long it takes to get critical analytical information needed for the success of the business. As the need for BI and DW has grown, various new products and technologies have been introduced to address different needs. Many are best suited for workloads that consist of a high volume of planned, repetitive reports and queries. An example of such an application would be a data warehouse used to support a retail call center. Each time a customer calls, the system calls up his or her account. This is a repetitive OLTP-like query that benefits from a specifically designed and engineered system to optimize the performance of these queries. Data warehouses using a traditional index-based architecture are well suited to this workload. But another growing area for data warehousing and BI is analytics. Examples may include, marketing, finance, sales, compliance, risk management, or operations groups performing ad hoc queries such as: “How did a particular 2007 Christmas sales campaign perform compared to our 2006 campaign?” or “Let’s analyze why there are more mortgage defaults in this area over the last 12 months versus the last five years.” The ad hoc nature and diversity of these requests make row-oriented, indexbased architectures a poor choice for an analytical data warehouse. By definition, DBAs don’t know what users will need in the future and are therefore unable to determine what indexes to create. As adding an index adds to both the size of the database and the time needed to load data.

References
  1. C-Store: A column-oriented DBMS, Stonebraker et al., Proceedings of the 31st VLDB Conference,Trondheim, Norway (2005)
  2. lzak et al., Brighthouse: an analytic data warehouse for ad-hoc queries, Proceedings of the34th VLDB Conference, Auckland, New Zealand (2008).
  3. Infobright open source data warehousing: Working smarter, not harder. Infobright IEE TechnicalBrief, March (2009).
  4. lzak, D., Wróblewski, J., Eastwood, V., and Synak, P. Bright-house: An Analytic DataWarehouse for Adhoc Queries. PVLDB 1(2) (2008) 1337-1345.
  5. lzak, D., Wróblewski, J., Eastwood, V., and Synak, P. Rough Sets in Data Warehousing.RSCTC 2008 (2008) 505-507.
  6. Agnew, N.M., and Pyke, S.W. The science game: An introduction to research in the behavioralandsocial sciences (7th ed.). Oxford University Press (2007).
  7. Liamputtong, P. Qualitative research methods (3rd ed.). Oxford University Press (2009).
  8. Enterprise Edition Data Warehouse Technology August 2008. www.infobright.com.
  9. Johnson, J.A., and Johnson, G.M. InfoBright for analyzing social sciences data.
  10. D. ?l?zak, T. Kim, Y. Zhang, J. Ma, & K. Chung (Eds.), Communications in Computer and Information Science, 64, Berlin: Springer (2009) 90-98.
  11. Widom, J. “Research Problems in Data Warehousing.” Proc. 4th Intl. CIKM Conf., 1995.
  12. Thusoo, A., Murthy, R., Sen Sarma, J., Shao, Z., Jain, N., Chakka, P., Anthony, A., Liu, H.,Zhang, N. 2010. Hive - A Petabyte Scale Data Warehouse Using Hadoop. In Proceedings of 26th IEEE International Conference on Data Engineering (Long Beach, California, Mar. 2010). ICDE'10.
  13. Anastassia Ailamaki , David J. DeWitt , Mark D. Hill , Marios Skounakis, Weaving Relations for Cache Performance, Proceedings of the 27th International Conference on Very Large Data Bases, p.169-180, September 11-14, 2001
  14. Zaharia, M., Borthakur, D., Sen Sarma, J., Elmeleegy, K., Shenker, S., Stoica, I. 2009. Job Scheduling for Multi-User MapReduce Clusters. UC Berkeley Technical Report UCB/EECS-2009-55 (Apr. 2009).
  15. Dr. Anjana Gosain , Suman Mann ,”Object Oriented Multidimensional Model for a Data Warehouse with Operators”, International Journal of Database Theory and Application Vol. 3, No. 4, December, 2010, pp 35-40
Index Terms

Computer Science
Information Sciences

Keywords

Infobright Optimizer Data Packs and Data Pack Nodes