First, please create an account

Already have a Sophia account?

Introduction to Databases

Author: Sophia

what's covered

You have already been introduced to the first two components of information systems: hardware and software. However, those two components by themselves do not make a computer useful. The missing piece is data! On personal computing devices, data is often stored locally, but in the business world, it’s much more common for important data to be stored in a centralized database management system. In this lesson, you will explore database concepts, including the different kinds of databases that are popular in businesses today, using both structured and unstructured data models. Specifically, this lesson will cover:

1. Data and Information
2. What Is a Database?
- 2a. Structured Databases
- 2b. Unstructured and Semi-Structured Databases
3. Elements of a Database

1. Data and Information

The term data refers to raw facts without context. Data can be quantitative data (numeric values from measurements, counts, or calculations) or qualitative data (descriptive details). For example, the number of cubic feet of trunk space in a 2025 Ford Focus is quantitative; the color of the car is qualitative.

On its own, data has limited value. It becomes useful when given context, transforming it into information. For example, if you read the numbers 15, 23, 14, 85 without explanation, it means little. But if you learn these numbers are the counts of students registered for specific classes, the data conveys meaning and becomes information.

terms to know

Quantitative Data

Numeric data that is the result of a measurement, count, or some other mathematical calculation.

Qualitative Data

Data that is descriptive.

Information

Data that has been organized and given context to make it meaningful.

2. What Is a Database?

Many information systems aim to turn data into information, which can then generate knowledge for better decision making. To do this, the system must place data in context and provide tools for searching, modifying, organizing, and removing it.

A database is designed for this purpose. It is an organized collection of related information, meaning each piece of data is described and connected to other data. All information in a database should be related in some way; unrelated information should be stored in separate databases. For example, a student database should not also store company stock prices.

Common examples of small databases include a phone book, a list of songs with their lengths, or student names with their birthdates. Databases also work behind websites and applications, such as a school’s database of student enrollment and attendance. At the enterprise level, very large databases manage the product inventory and ordering processes for huge online retailers like Amazon.

A database is essentially just a collection of stored data. For a database to be useful, there must be a way to add, remove, edit, and query the data. A database management system (DBMS) is software that stores, organizes, and manages data, providing tools that humans can use for data entry, retrieval, and security. A DBMS can be as small as a simple desktop app running on a single computer or as large as a multi-billion-dollar cloud-based network of interconnected servers and apps.

terms to know

Database

An organized collection of related information.

Database Management System (DBMS)

Software that stores, organizes, and manages data, providing tools that humans can use for data entry, retrieval, and security.

2a. Structured Databases

Databases are often classified according to the amount of structure they have—that is, the amount of consistency in how the data is stored.

A structured database stores data in a fixed, organized format, typically in tables with rows and columns, making it easy to search and manipulate.

Structured databases store data in tables. A table (also called a relation) is a two-dimensional grid for organizing data in fields and records. Each column in a table represents a field (also called an attribute), which is a type of data such as Employee ID, Phone, or Email. Each row in a table represents a record (also called a tuple), which is a set of field data for one instance, such as an individual employee.

watch

Check out the video below that describes elements of a structured database.

Structured databases can either be flat file or relational.

A flat file database consists of a single two-dimensional table with rows and columns, such as a class gradebook stored in a spreadsheet. Flat file databases work well for basic tasks such as storing, editing, printing, or displaying information. They can be created with spreadsheet software or even a plain text editor like Notepad, using commas to separate columns and paragraph breaks to separate rows.

A relational database contains multiple tables, with each table being related (connected) in some way to at least one other table. Most relational database management systems understand a standard set of commands known as Structured Query Language (SQL). Database administrators use SQL to create and manage tables, add and modify data in those tables, and retrieve data based on specific criteria to build useful reports.

terms to know

Structured Database

A database that stores data in a fixed, organized format, typically in tables with rows and columns, making it easy to search and manipulate.

Table

Also called a relation, a two-dimensional grid for organizing data in fields and records.

Field

Also called an attribute, a column in a database table representing an attribute of the data, such as Phone or ZIP code.

Record

Also called a tuple, a row in a database table representing one instance, such as an individual person or object.

Tuple

Also called a record, a row in a database table representing one instance, such as an individual person or object.

Flat File Database

A single two-dimensional table with orderly rows and columns, such as a class gradebook stored in a spreadsheet.

Relational Database

A collection of related database tables and the relationships between them.

Structured Query Language (SQL)

A set of standardized commands for working with a relational database.

2b. Unstructured and Semi-Structured Databases

In today’s complex data management environment, rigidly structured databases cannot meet every data storage need because not all information fits neatly into a consistent field-and-record format.

NoSQL is an umbrella term for database systems that do not follow the traditional relational model of fixed tables, rows, and columns. Many NoSQL databases are designed to handle unstructured or semi-structured data that does not fit neatly into rigid schemas.

An unstructured database holds data that does not follow a predefined format, such as text documents, images, videos, or social media posts. Unstructured databases require specialized tools and techniques for storage and retrieval.

A semi-structured database contains data that does not fit neatly into tables but still has some organizational structure. Such databases often use formats like JSON or XML, which include tags or keys to identify data elements while allowing flexibility in how the data is stored.

There are no rigid rules about how a NoSQL database management system should operate, but they generally break away from the strict table-and-row structure of relational databases to support more flexible ways of storing and retrieving data. For example, a document-oriented NoSQL database like MongoDB stores semi-structured data in flexible JSON-like documents, allowing each record to have different fields. A key-value store like Redis can handle unstructured data by associating arbitrary content with simple keys. Graph databases store data as nodes and relationships rather than in tables.

IN CONTEXT

Many large companies employ multiple database types to achieve their goals. For example, Amazon primarily uses structured databases to manage its product catalog, pricing, and inventory because this information fits well into organized tables with defined fields such as product ID, name, description, price, and stock level. However, Amazon also integrates unstructured and semi-structured databases to store related content like customer reviews (semi-structured text with ratings) and product images (unstructured).

terms to know

NoSQL

An umbrella term for database systems that do not follow the traditional relational model of fixed tables, rows, and columns.

Unstructured Database

A database that holds data that does not follow a predefined format.

Semi-Structured Database

A database that contains data that does not fit neatly into tables but still has some organizational structure.

3. Elements of a Database

A data model is an overall conceptual framework that defines how data is structured, related, and managed in a database. It describes the types of data, their relationships, and the rules for interacting with them. So far in this tutorial, you have learned about several different data models, including relational, document, key-value, and graph.

A schema is the specific implementation of a data model for a particular database. It defines the actual structure in detail, including table names, column names, data types, constraints, and relationships, based on the chosen data model. In other words, the data model is the theory, and the schema is the practice.

EXAMPLE

Suppose a veterinary practice has a database for keeping track of owners, pets, and their medical events. The data model for this database could be described as structured and relational.

The schema for the database consists of the specific tables and fields it contains, along with the relationships between them. The schema might include tables such as Pets, Owners, and Appointments, with relationships that connect each pet to an owner and to an appointment.

When a relational database designer is planning a database, part of the process is to create an entity-relationship diagram (ERD), which is a visual representation of the schema. The ERD shows each table in its own box, with lines drawn between the tables to indicate the relationships between them. The following ERD shows what the veterinary practice’s database might consist of.

terms to know

Data Model

An overall conceptual framework that defines how data is structured, related, and managed in a database.

Schema

The specific implementation of a data model for a particular database, defining the actual structure in detail, including its tables, fields, relationships, and data types.

Entity-Relationship Diagram (ERD)

A blueprint of the tables and fields in a relational database and the relationships between them.

summary

In this lesson, you learned about some important database principles. You began by discussing the difference between data and information, as well as quantitative vs. qualitative data. Next, you learned what a database is and explored some broad types of databases, including structured, unstructured, and semi-structured. The main types of structured databases are flat file (a basic one-table database) and relational (a multi-table database with connections between the tables). Alternatives to structured databases include NoSQL document databases like MongoDB, key-value stores like Redis, and graph databases. Finally, you learned about the elements of a database, including data models and schema. The data model is the broad type of database, like relational or key-value store, and the schema is a specific implementation of the data model type.

Source: This tutorial was authored by Sophia Learning. Please see our Terms of Use.

Terms to Know

Data Model: An overall conceptual framework that defines how data is structured, related, and managed in a database.
Database: An organized collection of related information.
Database Management System (DBMS): Software that stores, organizes, and manages data, providing tools that humans can use for data entry, retrieval, and security.
Entity-Relationship Diagram (ERD): A blueprint of the tables and fields in a relational database and the relationships between them.
Field: Also called an attribute, a column in a database table representing an attribute of the data, such as Phone or ZIP code.
Flat File Database: A single two-dimensional table with orderly rows and columns, such as a class gradebook stored in a spreadsheet.
Information: Data that has been organized and given context to make it meaningful.
NoSQL: An umbrella term for database systems that do not follow the traditional relational model of fixed tables, rows, and columns.
Qualitative Data: Data that is descriptive.
Quantitative Data: Numeric data that is the result of a measurement, count, or some other mathematical calculation.
Record: Also called a tuple, a row in a database table representing one instance, such as an individual person or object.
Relational Database: A collection of related database tables and the relationships between them.
Schema: The specific implementation of a data model for a particular database, defining the actual structure in detail, including its tables, fields, relationships, and data types.
Semi-Structured Database: A database that contains data that does not fit neatly into tables but still has some organizational structure.
Structured Database: A database that stores data in a fixed, organized format, typically in tables with rows and columns, making it easy to search and manipulate.
Structured Query Language (SQL): A set of standardized commands for working with a relational database.
Table: Also called a relation, a two-dimensional grid for organizing data in fields and records.
Tuple: Also called a record, a row in a database table representing one instance, such as an individual person or object.
Unstructured Database: A database that holds data that does not follow a predefined format.

First, please create an account

Introduction to Databases

Table of Contents

1. Data and Information

2. What Is a Database?

2a. Structured Databases

2b. Unstructured and Semi-Structured Databases

3. Elements of a Database