Data redundancy is a data organization issue that allows the unnecessary duplication of data within your Microsoft Access database. A change or modification, to redundant data, requires that you make changes to multiple fields of a database. While this is the expected behaviour for flat file database designs and spreadsheets, it defeats the purpose of relational database designs. The data relationships, inherent in a relational database, should allow you to maintain a single data field, at one location, and make the database’s relational model responsible to port any changes, to that data field, across the database. Redundant data wastes valuable space and creates troubling database maintenance problems.
To eliminate redundant data from your Microsoft Access database, you must take special care to organize the data in your data tables. Normalization is a method of organizing your data to prevent redundancy. Normalization involves establishing and maintaining the integrity of your data tables as well as eliminating inconsistent data dependencies.
Establishing and maintaining integrity requires that you follow the Access prescribed rules to maintain parent-child, table relationships. Eliminating inconsistent, data dependencies involves ensuring that data is housed in the appropriate Access database table. An appropriate table is a table in which the data has some relation to or dependence on the table.
Normalization requires that you adhere to rules, established by the database community, to ensure that data is organized efficiently. These rules are called normal form rules. Normalization may require that you include additional data tables in your Access database. Normal form rules number from one to three, for most applications. The rules are cumulative such that the rules of the 2nd normal form are inclusive of the rules in the 1st normal form. The rules of the 3rd normal form are inclusive of the rules in the 1st and 2nd normal forms, etc.
The rules are defined as follows:
1st normal form: Avoid storing similar data in multiple table fields.
- Eliminate repeating groups in individual tables.
- Create a separate table for each set of related data.
- Identify each set of related data with a primary key.
2nd normal form: Records should be dependent, only, upon a table’s primary key(s)
- Create separate tables for sets of values that apply to multiple records.
- Relate these tables with a foreign key.
3rd normal form: Record fields should be part of the record’s key
- Eliminate fields that do not depend on the key.
The 3rd normal form suggests that fields, that apply to more than one record, should be placed in a separate table. However, this may not be practical solution, particularly for small databases. The inclusion of additional tables may degrade database performance by opening more files than memory space allows. To overcome this limitation, of the third normal form, you may want to apply the third normal form only to data that is expected to change frequently.
Two, more advanced, normal forms have been established with application that is more complex. The Failure to conform to the established rules of these normal forms results in a less perfectly designed database, but the functionality of your database is not affected by avoiding them.
The advanced normal forms are as follows:
4th normal form: Boyce Codd Normal Form (BCNF)
- Eliminate relations with multi-valued dependencies.
5th normal form:
- Create relations that cannot be further decomposed.
In an Access database we create tables to hold data. A table is made up of columns, sometimes referred to as attributes or fields. Each of these columns will have a particular data type that defines the data it can hold. For example if we are entering numeric information such as prices, the best data type to use here would be number or currency. For a first name and surname it is better to use a text data type. An alternative to a text data type is what is known as memo. The Microsoft access memo data type will be used to hold text, but it allows a lot more text than the general text data type.
When should we use memo? This is best used in situations where we have to enter a description or even notes. We may wish to store a lot of information about a customer or about a particular transaction.
Generally when using a form we would increase the size of the memo text box and set it's key behavior property to 'New line in field'. This allows us to hit the return key so that the data is not bunched together as one long paragraph.
Keep in mind that you cannot sort or index a Microsoft access memo field. They also have severe limitations when used in queries. Really though you would tend to avoid using them in queries so this is not such a big issue.
In Microsoft Access 2007 we can apply rich text formatting to the memo field both is datasheet and form views. In order to do this we have to set the text format property of the memo field in the table to become 'Rich Text'.
It is always important that the correct data types are used throughout the database. Sometimes this is discovered through trial and error, although good planning can also go a long way to improving performance of the database.
Text
Use for text or combinations of text and numbers, such as addresses, or for numbers that do not require calculations, such as phone numbers, part numbers, or postal codes.
Stores up to 255 characters. The Field Size property controls the maximum number of characters that can be entered.
Memo
Use for lengthy text and numbers, such as notes or descriptions.
Stores up to 65,536 characters.
C. What are DBMS Forms?