Chapter 3 of 'IT for Management' discusses data management, business intelligence, and data analytics, highlighting the importance of databases, data warehouses, and data marts in storing and analyzing business data. It emphasizes the role of Business Intelligence (BI) in decision-making and the need for data quality and management to enhance analytics. The chapter also covers the challenges and architecture of BI, as well as the significance of electronic records management in supporting business processes.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
4 views45 pages
CH 03
Chapter 3 of 'IT for Management' discusses data management, business intelligence, and data analytics, highlighting the importance of databases, data warehouses, and data marts in storing and analyzing business data. It emphasizes the role of Business Intelligence (BI) in decision-making and the need for data quality and management to enhance analytics. The chapter also covers the challenges and architecture of BI, as well as the significance of electronic records management in supporting business processes.
Database Technologies: Data Warehouses • Integrate data from multiple databases and data silos, and organize them for complex analysis, knowledge discovery, and to support decision making • May require formatting processing and/or standardization • Loaded at specific times making them non-volatile and ready for analysis
Database Technologies: Data Marts • Small-scale data warehouses that support a single function or one department • Enterprises that cannot afford to invest in data warehousing may start with one or more data marts
Database Technologies: BI • Business Intelligence (BI) o Tools and techniques that process data and conduct statistical analysis for insight and discovery o Used to discover meaningful relationships in the data, keep informed of real time, detect trends, and identify opportunities and risks
Database Management Systems (DBMS) • Integrate with data collection systems such as TPS and business applications • Organized way to store, access, and manage data • Stores data in tables consisting of columns and rows, similar to the format of a spreadsheet • Standard database model adopted by most enterprises • Functions include: o Data filtering and profiling o Data integrity and maintenance o Data synchronization o Data security o Data access
Database Technologies: SQL • Relational Database Management Systems (DBMS) o Provides access to data using a declarative language • Declarative language o Simplifies data access by requiring that users only specify what data they want to access without defining how they will be achieved o Structured Query Language (SQL) is an example of declarative language: SELECT column_name(s) FROM table_name WHERE condition
OLTP and OLAP Systems Online Transaction Processing and Online Analytics Processing
• Online Transaction Processing (OLTP)
• Designed to manage transaction data, which are volatile & break down complex information into simpler data tarbles and strike a balance between transaction-processing efficiency and query efficiency • Cannot be optimized for data mining • Online Analytics Processing (OLAP) • A means of organizing large business databases • Divided into one or more cubes that fit the way business is conducted
Database Technologies: NOSQL • Trend toward NoSQL Systems o Higher performance o Easy distribution of data on different nodes • Enables scalability and fault tolerance o Greater flexibility o Simpler administration
Data Management and Database Technologies 1. Describe a database and database management system (DBMS). 2. Explain what an online transaction-processing (OLAP) system does. 3. Why are data in databases volatile? 4. Describe the functions of a DBMS. 5. Describe the purpose and benefits of data management. 6. What is a relational database management system?
Characteristics of Poor Quality or Dirty Data Characteristic Description Incomplete Missing data Outdated or Invalid Too old to be valid or useful Incorrect Too many errors Duplicated or in Too many copies or versions of the same data—and the versions conflict are inconsistent or in conflict with each other Non-standardized Data are stored in incompatible formats—and cannot be compared or summarized
Unusable Data are not in context to be understood or interpreted correctly
Data Life Cycle and Data Principles (1 of 2) • Principle of Diminishing Data Value o The value of data diminishes as they age o Blind spots (lack of data availability) of 30 days or longer inhibit peak performance o Global financial services institutions rely on near-real-time data for peak performance • Principle of 90/90 Data Use o As high as 90 percent, is seldom accessed after 90 days (except for auditing purposes) o Roughly 90 percent of data lose most of their value after 3 months
Data Life Cycle and Data Principles (2 of 2) • Principle of data in context o The capability to capture, process, format, and distribute data in near real time or faster requires a huge investment in data architecture o The investment can be justified on the principle that data must be integrated, processed, analyzed, and formatted in “actionable information”
Centralized and Distributed Database Architectures 1. Describe the data life cycle. 2. What is the function of master data management (MDM)? 3. What are the consequences of not cleaning “dirty data”? 4. Describe the differences between centralized and distributed databases. 5. Discuss how data ownership and organizational politics affect the quality of an organization’s data.
Data Preparation: Procedures to Prepare EDW Data for Analytics • Extract from designated databases • Transform by standardizing formats, cleaning the data, integration • Loading into a data warehouse
Data Warehouses: ADW • Active Data Warehouse (ADW) o Real-time data warehousing and analytics o Transform by standardizing formats, cleaning the data, integration • They provide o Interaction with a customer to provide superior customer service o Respond to business events in near real time o Share up-to-date status data among merchants, vendors, and associates
Data Warehouse Processing: Hadoop and MapReduce • Hadoop is an Apache processing platform that places no conditions on the processed data structure • MapReduce provides a reliable, fault-tolerant software framework to write applications easily that process vast amounts of data (multi-terabyte datasets) in-parallel on large clusters (thousands of nodes) of commodity software o Map stage: breaks up huge data into subsets o Reduce stage: recombines partial results
Data Warehouses 1. What are the differences between databases and data warehouses? 2. What are the differences between data warehouses and data marts? 3. Explain ETL. 4. Explain CDC. 5. What is an advantage of an active data warehouse (ADW)? 6. Why might a company invest in a data mart? 7. How can manufacturers and health care benefit from data analytics? 8. Explain how Hadoop implements MapReduce in two stages.
Four V’s of Data Analytics 1. Variety: The analytic environment has expanded from pulling data from enterprise systems to include big data and unstructured sources. 2. Volume: Large volumes of structured and unstructured data are analyzed. 3. Velocity: Speed of access to reports that are drawn from data defines the difference between effective and ineffective analytics. 4. Veracity: Validating data and extracting insight that manager and workers can trust are key factors successful analytics. Trust in analytics. Trust analytics has grown more difficult with the explosion of data sources.
Data Analytics: Human Expertise is Needed • To interpret the output of analytics, Big Data Specialists and Business Intelligence Analysts perform many tasks o Data preparation for analysis through data cleansing techniques, to eliminate duplicates or incomplete data o Dirty data degrade the value of analytics o Data must be put into meaningful context
o Data Mining: software that enables users to analyze data from various dimension or angles, categorize them, and find correlative patterns among fields in the data warehouse o Text Mining: broad category involving interpreted words and concepts in context o Sentiment Analysis: trying to understand consumer intent
Data Analytics and Data Discovery 1. Why are human expertise and judgment important to data analytics? Give an example. 2. What is the relationship between data quality and the value of analytics? 3. Why do data need to be put into a meaningful context? 4. How can manufacturers and health care benefit from data analytics? 5. How does data mining provide value? Give an example. 6. What is text mining? ? 7. What are the basic steps involved in text analytics?
Electronic Records Management • Business Records o Documentation of a business event, action, decision, or transaction • Electronic Records Management (EMR) o Workflow software, authoring tools, scanners, and databases that manage and archive electronic documents and image paper documents o Index and store documents according to company policy or legal compliance o Success depends on partnership of key players
ERM Practices and Standards • Best Practices o Effective systems capture all business data o Input from online forms, bar codes, sensors, websites, social sites, copiers, emails, and more • Industry Standards o Association for Information and Image Management (AIIM; www.aim.org) o National Archives and Records Administration (NARA; www.archives.gov) o ARMA International (formerly the Association of Records Managers and Administrators; www.arma.org)
ERM Benefits: an ERM can help a business • Access and use the content contained in documents • Cut labor costs by automating business processes • Reduce time and effort to locate require information for decision making • Improve content security, thereby reducing intellectual property theft risks • Minimize content printing, storing, and searching costs
ERM: Disaster Recovery, Business Continuity, and Compliance 1. Does the software meet the organization’s needs? For example, can the DMS be installed on the existing network? Can it be purchased as a service? 2. Is the software easy to use and accessible from Web browsers, office applications, and email applications? If not, people will not use it. 3. Does the software have lightweight, modern Web and graphical user interfaces that effectively support remote users? 4. Before selecting a vendor, it is important to examine workflows and how data, documents, and communications flow throughout the company.