Skip to content

Latest commit

 

History

History
485 lines (375 loc) · 17 KB

File metadata and controls

485 lines (375 loc) · 17 KB

Designing and Testing a Relational Database

Author

Justin Ferguson GitHub

Justin Ferguson LinkedIn

Contact


Other Versions

HTML Report

Background

The Happy Phone Company is a fictitious retailer of various makes and models of mobile phones. The business requires a new database to help manage data concerning employees, customers, suppliers, phones and contracts.

Business Rules

The database must satisfy the following business rules:

  • Information about The Happy Phone Company employees includes an employee ID, name and position.

  • Each customer of The Happy Phone Company is identified by a customer ID and has a name.

  • Suppliers to The Happy Phone Company have a unique name and contact phone number.

  • Phones stocked by The Happy Phone Company are described by a product code, retail price and stock quantity.

  • Details of phones need to be kept, including make and model information.

  • Contracts supplied by The Happy Phone Company are described by a contract number, contract term in months and contract price.

  • A phone must have at least one supplier and each supplier can supply many phones. When a supplier supplies a phone, data includes a supply price, the quantity supplied and the date of the supply. If a supplier provides several supplies of a phone, only data for the most recent supply is kept.

  • Details of a purchase by a customer shall include the date of purchase and a contract number (if applicable). A customer can purchase several phones in a single day and the details of each purchase must be kept.

Functional Dependencies of the Business Rules

The first step in creating a relational database is to obtain the functional dependencies from the business rules. A functional dependency is a logical relationship between two sets of attributes X={X1, X2, ..., Xn} and Y={Y1, Y2, ..., Yn}, and must satisfy the criteria X → Y, where the arrow notation is read in plain english as determine. In other words, all attributes together in X determine any attribute in Y, and any attribute in Y must be related to those in X.

The following are the set of minimal basis functional dependencies drawn from the business rules:

employee ID → employee name, position

customer ID → customer name

supplier name → phone number

product code → make, model, retail price, stock quantity

purchase number, contract number → contract term, contract price

supplier name, product code → supply price, supply quantity, supply date

purchase number → customer ID, product code, employee ID, purchase date

Normalisation

The process of normalisation in database design removes data redundancy by measuring the goodness of a relational schema with respect to the normal forms. An important concept in normalisation is the key of a relation. For a functional dependency X → Y, a key is the set of attributes in X that completely determine the remaining attributes in Y of a relation.

The following points describe the levels of normal forms, where higher normal forms remove data redundancy more deeply.

  • First Normal Form (1NF): Attributes of a relation are atomic values and do not contain sets.

  • Second Normal Form (2NF): Non-key attributes of a relation must be fully functionally dependent on a key.

  • Third Normal Form (3NF): A functional dependency FD X → Y of a relation must have X as a candidate key or Y as part of a (possibly different) candidate key

  • Boyce-Codd Normal Form (BCNF): A functional dependency FD X → Y where X is a key of the relation.

The standard for this database is either 3NF or BCNF. The keys of the relations inferred from the minimal set of functional dependencies above are {employee ID}, {customer ID}, {supplier name}, {product code}, {purchase number, contract number}, {supplier name, product code}, {purchase number}. With respect to the definitions above, the relations are all in BCNF.

Modelling the Business Rules

Functional dependencies and normalisation were used to find the non-redundant relationships between the attributes and determine the appropriate keys. The next step is to find the cardinalities of the relations. These are the zero-many (0..*), one-many (1..*)and one-one (1..1) relationships that will later determine the constraints of the relational model. An entity-relationship diagram in UML format has been created below for this purpose. Note the Contract class is a weak-entity set and the solid diamonds represent associations (1..1 relationships) between classes.

Figure 1: An ER diagram of the business model in UML format. Created at Lucidchart.

ER Model to Relational Database Schema Mapping

The ER diagram above is a high-level design of the database. To obtain a low-level design of the database, the diagram is mapped into a relational database schema, where underlined attributes represent primary keys, and attributes with asterisks represent foreign keys.

Supplier(supplierName, contactPhone)

Supply(supplierName*, productCode*, supplyPrice, supplyQuantity, supplyDate)

Phones(productCode, make, model, stockQuantity, retailPrice)

Customer(custID, custName)

Purchase(purchaseNo, purchaseDate, custID*, productCode*, empID*)

Contract(contractNo, purchaseNo*, contractTerm, contractPrice)

Employee(empID, empName, position)

Creating the Relational Database in Oracle and Inserting a Sample Data Set

Database tables have been created from the relational schema in Oracle SQL Developer with SQL CREATE TABLE and INSERT statements. Sample data has been created and inserted into these tables to allow for realistic testing later. The method used to create the sample data is briefly described here.

Below is an extract of the relational database SQL script, with a few INSERT statements included to show method. The complete script can be viewed here.

SQL> CREATE TABLE Supplier(
  2      supplierName varchar(30),
  3      contactPhone char(12),
  4      PRIMARY KEY(supplierName)
  5      );

Table created.

SQL> insert into Supplier (supplierName, contactPhone) values ('Hansen and Sons', '428-147-4213');

1 row created.

SQL> insert into Supplier (supplierName, contactPhone) values ('Dicki-Thompson', '871-171-0605');

1 row created.
SQL> CREATE TABLE Phones(
  2      productCode char(11),
  3      make varchar(17),
  4      model varchar(13),
  5      stockQuantity int,
  6      retailPrice int,
  7      PRIMARY KEY(productCode)
  8      );

Table created.

SQL> insert into Phones (productCode, make, model, stockQuantity, retailPrice) values ('675-971-898', 'Apple', 'pIJ-91679356G', '1', 741);

1 row created.

SQL> insert into Phones (productCode, make, model, stockQuantity, retailPrice) values ('105-878-560', 'Apple', 'CRL-52853444Q', '3', 937);

1 row created.
SQL> CREATE TABLE Customer(
  2      custID varchar(4),
  3      custName varchar(25),
  4      PRIMARY KEY(custID)
  5      );

Table created.

SQL> insert into Customer (custID, custName) values (1, 'Eydie Quixley');

1 row created.

SQL> insert into Customer (custID, custName) values (2, 'Isa Arunowicz');

1 row created.
SQL> CREATE TABLE Employee(
  2      empID varchar(2),
  3      empName varchar(16),
  4      position varchar(17),
  5      PRIMARY KEY(empID)
  6      );

Table created.

SQL> insert into Employee (empID, empName, position) values (1, 'Judith Mumbey', 'Director');

1 row created.

SQL> insert into Employee (empID, empName, position) values (2, 'Moishe Forsey', 'Manager');

1 row created.
SQL> CREATE TABLE Purchase(
  2      purchaseNo char(12),
  3      purchaseDate date,
  4      custID varchar(4),
  5      productCode char(11),
  6      empID varchar(2),
  7      PRIMARY KEY(purchaseNo),
  8      FOREIGN KEY(custID) REFERENCES Customer(custID),
  9      FOREIGN KEY(productCode) REFERENCES Phones(productCode),
 10      FOREIGN KEY(empID) REFERENCES Employee(empID)
 11      );

Table created.

SQL> insert into Purchase (purchaseNo, purchaseDate, custID, productCode, empID) values ('P634-340-525', '20-Sep-2019', 425, '334-405-233', 4);

1 row created.

SQL> insert into Purchase (purchaseNo, purchaseDate, custID, productCode, empID) values ('P672-549-119', '01-Mar-2020', 258, '852-219-856', 5);

1 row created.
SQL> CREATE TABLE Supply(
  2      supplierName varchar(30),
  3      productCode char(11),
  4      supplyPrice int,
  5      supplyQuantity int,
  6      supplyDate date,
  7      PRIMARY KEY(supplierName, productCode),
  8      FOREIGN KEY(supplierName) REFERENCES Supplier(supplierName),
  9      FOREIGN KEY(productCode) REFERENCES Phones(productCode)
 10      );

Table created.

SQL> insert into Supply (supplierName, productCode, supplyPrice, supplyQuantity, supplyDate) values ('Anderson and Sons', '675-971-898', 365, '0', '02-Sep-20');

1 row created.

SQL> insert into Supply (supplierName, productCode, supplyPrice, supplyQuantity, supplyDate) values ('Douglas LLC', '105-878-560', 394, '8', '15-Sep-20');

1 row created.
SQL> CREATE TABLE Contract(
  2      contractNo char(12),
  3      purchaseNo char(12),
  4      contractTerm int,
  5      contractPrice int,
  6      PRIMARY KEY(contractNo, purchaseNo),
  7      FOREIGN KEY(purchaseNo) REFERENCES Purchase(purchaseNo)
  8      );

Table created.

SQL> insert into Contract (contractNo, purchaseNo, contractTerm, contractPrice) values ('C908-706-335', 'P497-820-348', 12, 1020);

1 row created.

SQL> insert into Contract (contractNo, purchaseNo, contractTerm, contractPrice) values ('C039-465-806', 'P903-920-423', 36, 2196);

1 row created.

Testing the Relational Database in Oracle with SQL Queries

A database must be able to handle SQL queries created by an employee based on questions derived from the business rules. Examples include:

  • Which employee had the most overall sales?
  • Which customer purchased the most products in a given period?
  • How many sales were made on a given date?
  • How many phone contracts were sold as purchases on a given date?
  • What is the most popular selling make and model of phone?
  • What is the cost difference between retailPrice and average contractPrice for a given model of phone?
  • What is the cost difference between supplyPrice and retailPrice for a given model of phone?
  • Which supplier last supplied a particular model of phone?
  • How many products were received on a given date?

The following SQL queries demonstrate the good design of the database to handle such questions.

-- Which employee had the most overall sales?
SQL> SELECT
  2      e.empid,
  3      empname,
  4      COUNT(e.empid) AS "SALES COUNT"
  5  FROM
  6           employee e
  7      JOIN purchase p ON e.empid = p.empid
  8  GROUP BY
  9      e.empid,
 10      empname
 11  HAVING
 12      COUNT(e.empid) >= ALL (
 13          SELECT
 14              COUNT(e.empid)
 15          FROM
 16                   employee e
 17              JOIN purchase p ON e.empid = p.empid
 18          GROUP BY
 19              e.empid
 20      );

EMPID EMPNAME          SALES COUNT
----- ---------------- -----------
6     Kleon Dewey              107                                              
-- Which customer purchased the most products in November 2020?
SQL> SELECT
  2      c.custid,
  3      c.custname,
  4      COUNT(c.custid) AS "PURCHASES"
  5  FROM
  6           customer c
  7      JOIN purchase p ON c.custid = p.custid
  8  WHERE
  9      p.purchasedate BETWEEN '01/NOV/20' AND '30/NOV/20'
 10  GROUP BY
 11      c.custid,
 12      c.custname
 13  HAVING
 14      COUNT(c.custid) >= ALL (
 15          SELECT
 16              COUNT(c.custid)
 17          FROM
 18                   customer c
 19              JOIN purchase p ON c.custid = p.custid
 20          WHERE
 21              p.purchasedate BETWEEN '01/NOV/20' AND '30/NOV/20'
 22          GROUP BY
 23              c.custid
 24      );

CUSTID CUSTNAME                   PURCHASES
------ ------------------------- ----------
696    Fanya Attard                       2
-- How many sales were made on 07/DEC/20?
SQL> SELECT
  2      SUM(COUNT(purchaseno)) AS "TOTAL SALES"
  3  FROM
  4      purchase
  5  WHERE
  6      purchasedate = '07/DEC/20'
  7  GROUP BY
  8      purchaseno;

TOTAL SALES
-----------
          3
-- How many phone contracts were sold as purchases on 07/DEC/20?
SQL> SELECT
  2      SUM(COUNT(contractno)) AS "CONTRACTS SOLD"
  3  FROM
  4      contract  c,
  5      purchase  p
  6  WHERE
  7          c.purchaseno = p.purchaseno
  8      AND p.purchasedate = '07/DEC/20'
  9  GROUP BY
 10      contractno;

CONTRACTS SOLD
--------------
             1
-- What is the most popular selling make and model of phone?
SQL> SELECT
  2      make,
  3      model,
  4      COUNT(model) AS "SOLD"
  5  FROM
  6           phones ph
  7      JOIN purchase pu ON pu.productcode = ph.productcode
  8  GROUP BY
  9      make,
 10      model
 11  HAVING
 12      COUNT(model) >= ALL (
 13          SELECT
 14              COUNT(model)
 15          FROM
 16                   phones ph
 17              JOIN purchase pu ON pu.productcode = ph.productcode
 18          GROUP BY
 19              make,
 20              model
 21      );

MAKE              MODEL               SOLD
----------------- ------------- ----------
Apple             irj-11736694U         25
-- What is the cost difference between retailPrice and average contractPrice for the Apple model irj-11736694U?
SQL> SELECT DISTINCT
  2      make,
  3      model,
  4      contractterm,
  5      retailprice,
  6      round(AVG(contractprice))                      AS "AVERAGE CONTRACTPRICE",
  7      round(AVG(contractprice)) - retailprice        AS "COST DIFFERENCE"
  8  FROM
  9      phones    ph,
 10      purchase  pu,
 11      contract  co
 12  WHERE
 13          pu.productcode = ph.productcode
 14      AND pu.purchaseno = co.purchaseno
 15      AND model = 'irj-11736694U'
 16  GROUP BY
 17      make,
 18      model,
 19      contractterm,
 20      retailprice
 21  ORDER BY
 22      contractterm;

MAKE              MODEL         CONTRACTTERM RETAILPRICE AVERAGE CONTRACTPRICE COST DIFFERENCE
----------------- ------------- ------------ ----------- --------------------- ---------------
Apple             irj-11736694U           12        1131                  1196              65
Apple             irj-11736694U           24        1131                  1752             621
Apple             irj-11736694U           36        1131                  2079             948
-- What is the cost difference between supplyPrice and retailPrice for the Apple model irj-11736694U?
SQL> SELECT
  2      make,
  3      model,
  4      retailprice,
  5      supplyprice,
  6      ( retailprice - supplyprice ) AS "COST DIFFERENCE"
  7  FROM
  8           phones p
  9      JOIN supply s ON p.productcode = s.productcode
 10  WHERE
 11      model = 'irj-11736694U';

MAKE              MODEL         RETAILPRICE SUPPLYPRICE COST DIFFERENCE
----------------- ------------- ----------- ----------- ---------------
Apple             irj-11736694U        1131         314             817
-- Which supplier last supplied an Apple model phone?
SQL> SELECT
  2      suppliername,
  3      make,
  4      MAX(supplydate) AS "SUPPLYDATE"
  5  FROM
  6      supply  s,
  7      phones  p
  8  WHERE
  9          s.productcode = p.productcode
 10      AND make = 'Apple'
 11  GROUP BY
 12      suppliername,
 13      make
 14  HAVING
 15      MAX(supplydate) >= ALL (
 16          SELECT
 17              MAX(supplydate)
 18          FROM
 19              supply  s,
 20              phones  p
 21          WHERE
 22                  s.productcode = p.productcode
 23              AND make = 'Apple'
 24      );

SUPPLIERNAME                   MAKE  SUPPLYDATE
------------------------------ ----- ----------
Larson Inc                     Apple  13-DEC-20
-- How many products were received on 01/DEC/20?
SQL> SELECT
  2      supplydate,
  3      SUM(supplyquantity) AS "PRODUCTS RECEIVED"
  4  FROM
  5      supply
  6  WHERE
  7      supplydate = '15/SEP/20'
  8  GROUP BY
  9      supplydate;

SUPPLYDATE PRODUCTS RECEIVED
---------- -----------------
15-SEP-20                 21