managing data resources file organization and databases
TRANSCRIPT
![Page 1: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/1.jpg)
Managing Data Managing Data ResourcesResources
Managing Data Managing Data ResourcesResources
File Organization and databases
![Page 2: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/2.jpg)
Problems with the Traditional File Environment
Data Redundancy and Inconsistency:
• Data redundancy: The presence of duplicate data in multiple data files so that the same data are stored in more than one place or location
• Data inconsistency: The same attribute may have different values.
ORGANIZING DATA IN A TRADITIONAL FILE ENVIRONMENT
![Page 3: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/3.jpg)
• The coupling of data stored in files and the specific programs required to update and maintain those files such that changes in programs require changes to the data
Lack of flexibility: • A traditional file system can deliver routine
scheduled reports after extensive programming efforts, but it cannot deliver ad-hoc reports or respond to unanticipated information requirements in a timely fashion.
ORGANIZING DATA IN A TRADITIONAL FILE ENVIRONMENT
Program-data dependence:
Problems with the Traditional File Environment
![Page 4: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/4.jpg)
• Because there is little control or management of data, management will have no knowledge of who is accessing or even making changes to the organization’s data.
Lack of data sharing and availability: • Information cannot flow freely across different
functional areas or different parts of the organization. Users find different values of the same piece of information in two different systems, and hence they may not use these systems because they cannot trust the accuracy of the data.
ORGANIZING DATA IN A TRADITIONAL FILE ENVIRONMENT
Poor security:Problems with the Traditional File Environment
![Page 5: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/5.jpg)
The Contemporary Database EnvironmentTHE DATABASE APPROACH TO DATA MANAGEMENT
![Page 6: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/6.jpg)
Components of DBMS:
• Data definition language: Specifies content and structure of database and defines each data element
• Data manipulation language: Used to process data in a database
• Data dictionary: Stores definitions of data elements and data characteristics
THE DATABASE APPROACH TO DATA MANAGEMENT
![Page 7: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/7.jpg)
Sample Data Dictionary ReportTHE DATABASE APPROACH TO DATA MANAGEMENT
![Page 8: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/8.jpg)
Types of Databases:
• Relational DBMS
• Hierarchical and network DBMS
• Object-oriented databases
THE DATABASE APPROACH TO DATA MANAGEMENT
![Page 9: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/9.jpg)
Relational DBMS:
• Represents data as two-dimensional tables called relations
• Relates data across tables based on common data element
• Examples: DB2, Oracle, MS SQL Server
THE DATABASE APPROACH TO DATA MANAGEMENT
![Page 10: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/10.jpg)
The Relational Data ModelTHE DATABASE APPROACH TO DATA MANAGEMENT
![Page 11: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/11.jpg)
Three Basic Operations in a Relational Database:
• Select: Creates subset of rows that meet specific criteria
• Join: Combines relational tables to provide users with information
• Project: Enables users to create new tables containing only relevant information
THE DATABASE APPROACH TO DATA MANAGEMENT
![Page 12: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/12.jpg)
The Three Basic Operations of a Relational DBMSTHE DATABASE APPROACH TO DATA MANAGEMENT
![Page 13: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/13.jpg)
Hierarchical and Network DBMS Hierarchical and Network DBMS
• Organizes data in a tree-like structure
• Supports one-to-many parent-child relationships
• Prevalent in large legacy systems
THE DATABASE APPROACH TO DATA MANAGEMENT
Hierarchical DBMS:
![Page 14: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/14.jpg)
A Hierarchical Database for a Human Resources System
THE DATABASE APPROACH TO DATA MANAGEMENT
![Page 15: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/15.jpg)
Hierarchical and Network DBMS Hierarchical and Network DBMS
• Depicts data logically as many-to-many relationships
THE DATABASE APPROACH TO DATA MANAGEMENT
Network DBMS:
![Page 16: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/16.jpg)
The Network Data Model
THE DATABASE APPROACH TO DATA MANAGEMENT
![Page 17: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/17.jpg)
Hierarchical and Network DBMS Hierarchical and Network DBMS
• Outdated
• Less flexible compared to RDBMS
• Lack support for ad-hoc and English language-like queries
Disadvantages:
THE DATABASE APPROACH TO DATA MANAGEMENT
![Page 18: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/18.jpg)
• Object-oriented DBMS: Stores data and procedures as objects that can be retrieved and shared automatically
• Object-relational DBMS: Provides capabilities of both object-oriented and relational DBMS
Object-Oriented Databases:
THE DATABASE APPROACH TO DATA MANAGEMENT
![Page 19: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/19.jpg)
• Conceptual design: Abstract model of database from a business perspective
• Physical design: Detailed description of business information needs
CREATING A DATABASE ENVIRONMENT
Designing Databases:
![Page 20: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/20.jpg)
• Entity-relationship diagram: Methodology for documenting databases illustrating relationships between database entities
• Normalization: Process of creating small stable data structures from complex groups of data
CREATING A DATABASE ENVIRONMENT
Designing Databases
![Page 21: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/21.jpg)
An Unnormalized Relation for ORDER
CREATING A DATABASE ENVIRONMENT
![Page 22: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/22.jpg)
Normalized Tables Created from ORDERCREATING A DATABASE ENVIRONMENT
![Page 23: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/23.jpg)
An Entity-Relationship DiagramCREATING A DATABASE ENVIRONMENT
![Page 24: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/24.jpg)
Centralized database:
• Used by single central processor or multiple processors in client/server network
• There are advantages and disadvantages to having all corporate data in one location.
• Security is higher in central environments, risks lower.
• If data demands are highly decentralized, then a decentralized design is less costly, and more flexible.
CREATING A DATABASE ENVIRONMENTDistributing Databases Distributing Databases
![Page 25: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/25.jpg)
• Databases can be decentralized either by
partitioning or by replicating
• Partitioned database: Database is divided
into segments or regions. For example, a
customer database can be divided into
Eastern customers and Western customers,
and two separate databases maintained in
the two regions.
CREATING A DATABASE ENVIRONMENT
Distributed database:
![Page 26: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/26.jpg)
• Duplicated database: The database is completely duplicated at two or more locations. The separate databases are synchronized in off hours on a batch basis.
• Regardless of which method is chosen, data administrators and business managers need to understand how the data in different databases will be coordinated and how business processes might be effected by the decentralization.
CREATING A DATABASE ENVIRONMENT
![Page 27: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/27.jpg)
Distributed DatabasesCREATING A DATABASE ENVIRONMENT
![Page 28: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/28.jpg)
Ensuring Data Quality: • Corporate and government databases
have unexpectedly poor levels of data quality.
• National consumer credit reporting databases have error rates of 20-35%.
• 32% of the records in the FBI’s Computerized Criminal History file are inaccurate, incomplete, or ambiguous.
• Gartner Group estimates that consumer data in corporate databases degrades at the rate of 2% a month.
CREATING A DATABASE ENVIRONMENT
Management Information SystemsManagement Information SystemsManaging Data Resources Managing Data Resources
![Page 29: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/29.jpg)
• The quality of decision making in a firm is directly related to the quality of data in its databases.
• Data Quality Audit: Structured survey of the accuracy and level of completeness of the data in an information system
• Data Cleansing: Consists of activities for detecting and correcting data in a database or file that are incorrect, incomplete, improperly formatted, or redundant
CREATING A DATABASE ENVIRONMENTEnsuring Data Quality
![Page 30: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/30.jpg)
Online Analytical Processing (OLAP):
• Multidimensional data analysis
• Supports manipulation and analysis of large volumes of data from multiple dimensions/perspectives
DATABASE TRENDS Multidimensional Data Analysis Multidimensional Data Analysis
![Page 31: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/31.jpg)
Data warehouse:
• Supports reporting and query tools
• Stores current and historical data
• Consolidates data for management analysis and decision making
DATABASE TRENDS
Data Warehousing and Data Mining Data Warehousing and Data Mining
Management Information SystemsManagement Information SystemsManaging Data Resources Managing Data Resources
![Page 32: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/32.jpg)
Components of a Data WarehouseDATABASE TRENDS
![Page 33: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/33.jpg)
Data mart: • Subset of data warehouse
• Contains summarized or highly focused portion of data for a specified function or group of users
DATABASE TRENDS
Data mining:• Tools for analyzing large pools of data
• Find hidden patterns and infer rules to predict trends
![Page 34: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/34.jpg)
Data warehouseA relational database managementsystem designed specifically to support management decision makingCurrent evolution of Decision Support Systems (DSSs)
Data mart A subset of a data warehouse for smalland medium-size businesses or departments within larger companies
DATABASE TRENDS
![Page 35: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/35.jpg)
Designing a Customer Data Warehouse
Sharply define your goals and objectives before you build the warehouse
Choose the software that best fits your goalsDetermine who/what should be in the databaseDevelop a planMeasure results
DATABASE TRENDS
![Page 36: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/36.jpg)
Data Mining Applications
Data mining The automated discovery of patterns and relationships in a
data warehouseData mining applications
• Market segmentation
• Customer queries
• Fraud detection
• Direct marketing
• Market basket analysis
• Trend analysis
DATABASE TRENDS
![Page 37: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/37.jpg)
Relationaldatabases
Hierarchicaldatabases
Networkdatabases
Flat files
Spreadsheets
Dataextractionprocess
Query andanalysis
tools
Datawharehouse
Datacleanupprocess
![Page 38: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/38.jpg)
The Web and Hypermedia database:
• Organizes data as network of nodes
• Links nodes in pattern specified by user
• Supports text, graphic, sound, video, and executable programs
DATABASE TRENDS
Databases and the Web Databases and the Web
![Page 39: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/39.jpg)
Database server:
• Computer in a client/server environment
runs a DBMS to process SQL statements
and perform database management tasks.
Application server:
• Software handling all application operations
DATABASE TRENDS
Databases and the Web
![Page 40: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/40.jpg)
Linking Internal Databases to the Web
DATABASE TRENDS
![Page 41: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/41.jpg)
Management Opportunities:
MANAGEMENT OPPORTUNITIES, CHALLENGES, AND SOLUTIONS
Business firms have exceptional Business firms have exceptional
opportunities to exploit modern opportunities to exploit modern
relational database technologies to relational database technologies to
improve decision making, and to improve decision making, and to
increase the efficiency of their increase the efficiency of their
business processes. business processes.
![Page 42: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/42.jpg)
MANAGEMENT OPPORTUNITIES, CHALLENGES, AND SOLUTIONS
Management Challenges:
• Organizational obstacles to a database environment Need for cooperation in developing corporate-wide data administration
• Cost/benefit considerations
Bringing about significant change in the database environment of a firm can be very expensive and time consuming.
![Page 43: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/43.jpg)
Solution Guidelines:
MANAGEMENT OPPORTUNITIES, CHALLENGES, AND SOLUTIONS
The critical elements for creating a database environment are:
• Data administration
• Data-planning and modeling methodology
• Database technology and management
• Users
![Page 44: Managing Data Resources File Organization and databases](https://reader035.vdocuments.us/reader035/viewer/2022070404/56649f395503460f94c56984/html5/thumbnails/44.jpg)
Key Organizational Elements in the Database Environment
MANAGEMENT OPPORTUNITIES, CHALLENGES, AND SOLUTIONS