Data Storage For Apps - Cookies, Local Storage, File Storage, Databases, APIs, and More
Various mechanisms for storing data in front- and back-ends.
- Overview
- Cookies
- Local Storage
- IndexedDB
- File Storage
- Relational Databases
- MongoDB
- Mongoose
- Conclusions
Overview
Concept
There are a variety of mechanisms apps can use to store data. These can be generally broken-down into two categories:
- Client-side
- Server-side
Client-server paradigm
A reminder of the “bread-and-butter” of the Internet: clients and servers:
Client-server paradigm
… and the technologies that often power each:
Client-server paradigm
… often with a database attached to the server.
Client-server paradigm
… or so-called serverless architectures.
Persistence within a single client session
HTTP is a stateless protocol. This means that each HTTP request is ignorant of any data produced by requests before it - each request is made from a blank slate.
-
However, app developers usually want dynamic content to be displayed in the client based on a user’s prior interactions with the server.
-
For example, a “logged in” user might see one set of content while a logged-out user might seee something totally different.
-
The client-side storage mechanisms are generally used to hold data that is passed to the server along with every request to “remind” the server of the results of prior interactions.
-
This way apps can maintain state, even when using a stateless protocol such as HTTP.
Offline data
A sophisticated app can sometimes be used (to a limited extent) even when the client is disconnected from the Internet.
-
Client-side data storage mechanisms can be used to store any content necessary for offline use.
-
Once the device reconnects to the Internet, this data can be uploaded to a server for more persistent storage, if desired.
Persistence across clients
In addition to maintaining state within a single client session, it is often desireable to have data persist across more than one client session.
-
For example, a news web site would like all visitors to see the same articles each day.
-
A single post on a social media app is often viewable by many different users.
-
A user of a social network usually expects to see the same content regardless of whether they connect via a mobile device or a desktop.
-
These scenarios typically require data to be stored on the server-side so that the same data can be sent to a user, regardless of which client is used.
Client-side
- Cookies
- Local storage
- IndexedDB
- Serverless databases
Server-side
- File storage
- Relational databases
- NoSQL databases
- Serverless databases
Cookies
Concept
Cookies are simple text key/value pairs stored in a client web browser. With every request, these key value pairs are sent in the HTTP headers to the server.
-
The lifecycle of a cookie begins when a web browser makes an HTTP request to a web server for a resource.
-
In the response, the server can instruct the client to store a cookie by sending a
Set-Cookie
HTTP header in its response.
Set-Cookie: userid=22
- With every subsequent request to that same server, the web browser will automatically include the cookie data in its HTTP request headers.
Cookie: userid=22
- Presumably, the server uses this data to customize its subsequent responses to this client or track the usage of this particular client.
Limitations
Cookies are intended to be read by servers and have a few limitations that make their use limited.
-
Cookies can store text only, storing structured data is challenging.
-
Cookies are limited to 4 kilobytes per server domain.
-
Cookie data will always be sent to the relevant server with every request.
-
Cookies expire at a given date specified by the server.
-
As with all client-side technologies, it’s easy for a user to fiddle with cookie values.
Local Storage
Concept
Local storage (a.k.a. Web Storage) is another client-side data storage mechanism that serves a different use case to cookies.
-
Intended to be read by the client. Local storage is never automatically sent to the server.
-
Only accessed via client-side Javascript code.
-
5 megabytes or more of data storage (depending on the browser) per domain.
-
No expiration date - local storage can only be cleared by Javascript or by a user wiping the browser’s data.
-
Supports only text data, like cookies, but easy to serialize and deserealize object data with
JSON.stringify(myObj, null, 2)
andJSON.parse(myString)
, respectively. This is the best practice for storing data in local storage.
IndexedDB
Concept
IndexedDB is an alternative and successor to Local Storage for web-browser-based data storage.
Comparison to Local Storage
Compared to Local Storage, IndexedDB offers:
-
more storage space
-
support for other data types, in addition to strings
-
faster retrieval of data for large data sets, due to indexing
-
asynchronous operations, so as not to block app performance
Which to use?
In short, use IndexDB when…
-
you need to store a lot of data
-
you need to store more than just key->value pairs
For most other cases, Local Storage is probably simpler.
Serverless Databases
Concept
A new(ish) breed of serverless databases provide storage facilities for both client- and server-side needs.
-
e.g. Google Firebase, Amazon Lambda, Aurora, and DynamoDB, Azure Data Lake, and more.
-
simple to set up - no DB admin job training necessary
-
scalable - as cloud services, they operate aas virtual environments across many data centers
-
typically offer API clients in a variety of popular languages -
Javascript
,Python
,C#
etc. -
handle structured data, like objects as well as simple unstructured data
-
increasingly do more than data storage, including tasks typically requiring server logic, such as user authentication, machine learning, etc.
File Storage
Concept
Sometimes simple file system storage is all that is needed by an app, for example for user-generated images, audio tracks, videos, etc.
-
Small-to-medium apps can store files on the web server itself. While client-side technologies are ‘sandboxed’ and can’t access the local file system, server-side code has no such constraints.
-
More demanding file storage needs can be met using a separate remote file storage server, such as Amazon S3 or Elastic File System or Azure Files
-
It is usually not worthwhile to store user-generated assets such as images, audio, or video files in a database. But the path to those files can be stored in the database.
Example: multer
The multer Express.js middleware makes handling uploaded files simple.
-
Multer can be configured to store files directly on the web server, or integrate and send them to remote storage services, like Amazon AWS or Azure.
-
For example, to set
multer
to store uploaded files to the web server file system, install it (npm install --save multer
), import it (const multer = require('multer')
), and then:
var storage = multer.diskStorage({
destination: function (req, file, cb) {
// store files into a directory named 'uploads'
cb(null, "/uploads")
},
filename: function (req, file, cb) {
// rename the files to include the current time and date
cb(null, file.fieldname + "-" + Date.now())
},
})
var upload = multer({ storage: storage })
Example: multer (continued)
Once set up, any route can pass the request object through the multer
middleware prior to handling the response.
multer
will upload the files as configured and make the metadata about them available in the request object.
// assuming multiple files labeled as 'files' were included in the HTTP request
router.post('/upload-route-example', upload.array('files'), (req, res) => {
console.log( JSON.stringify(req.files, null, 2) )
if (req.files.length) res.json( { success: true, message: 'thanks!' })
else res.status(500).send({ success: false, message: 'Oops... something went wrong!' })
}
Example: multer (continued)
Now, the client, for example a web browser, can allow a user to upload a file and send it to the server to be handled by this route.
-
The web browser must display a
<form>
element to the user, with an<input type="file" name="files">
child element - this shows the user a file picker in the interface. -
Once the user submits the form, the client Javascript code can use the browser’s native
FormData
object to package up the form data, including any files picked by the user. -
The client-side Javascript can then make an asynchronous HTTP POST request to the server, sending it the data in the
FormData
object. -
See some simple example code using
axios
to make the request to the server.
Relational Databases
Concept
Relational databases stored data in a row/column-type format. They are essentially glorified spreadsheets.
- For example, data on animals might be stored as rows and columns in a table named
animals
, where each row represents a single record of a single animal.
id | common_name | big_or_not | image_path |
---|---|---|---|
1 | giraffe | big | uploads/giraffe.jpg |
2 | squirrel | not big | uploads/squirrel.png |
- Relational databases generally require the SQL language to send them instructions, for example to retrieve data.
SELECT common_name, image_path FROM animals WHERE id=2
CRUD
Databases, relational or not, are typically used to do one of four tasks, affectionately referred to with the acronym, CRUD.
-
Create - create new records (i.e. rows) in the database. Typically done using the SQL
INSERT
command. -
Read - retrieve records from the database. Typically done using the SQL
SELECT
command. -
Update - modify existing records in the database. Typically done using the SQL
UPDATE
command. -
Delete - delete existing records from the database. Typically done using the SQL
DELETE
command. -
See some examples of each.
Joins
The bane of every relational database developer’s existence is the concept of joins.
-
Related records from more than one table are joined together.
-
For example, if I want to keep track of the animals in my personal, I might have another table named
inventory
that stored each of my animals’ names, and a reference to the relevant record in theanimals
table.
id | animal_id | name |
---|---|---|
1 | 1 | Bob |
2 | 1 | Sheila |
- To get the full data of the animals in my personal zoo in the
inventory
table combined with the abstract animal data in theanimals
table…
SELECT * FROM inventory INNER JOIN animals ON inventory.animal_id = animals.id
MongoDB
Concept
MongoDB is one of many increasingly-popular NoSQL databases that do not use SQL and do not store data in row/column format.
-
MongoDB is a document-oriented database, meaning records are stored as JSON objects called ‘documents’… not ‘rows’ as in relational databases.
-
Documents are bundled up into groups called collecetions… not ‘tables’ as in relational databases.
-
Documents generally hold all the information necessary to store data about a single record…. joins between multiple documents are possible, but frowned upon unless absolutely necessary.
Example document
For example, a document that represents an animal in my personal zoo might look like:
{
id: 1,
name: 'Bob',
type: {
commonName: 'giraffe',
bigOrNot: 'big',
imagePath: 'uploads/giraffe.jpg'
}
}
- Notice that the general data about the animal ‘giraffe’ is bundled up into the specific animal document about ‘Bob’…. this is the conventional MongoDB way.
MongoDB Atlas
MongoDB offers a cloud-based database server named Atlas with a free tier for trying it out…
-
This can be a useful first database server.
-
It is also possible to download and install MongoDB locally on development machines and run it on a different port than the Express.js web server.
Mongoose
When connecting an Express.js web server to a MongoDB server, it is wise to use mongoose, a 3rd-party module.
-
Mongoose abstracts away the details of MongDB, and provides a simple Javascript interface.
-
See example code.
Credentials
Storing database credentials in version control is a big mistake!
- use the dotenv module or Node’s native capabilities to load credentials into code from bash environmental variables stored in a hidden non-version-controlled file named
.env
.
Conclusions
You have seen a brief overview of various data storage mechanisms.
-
This example repository includes examples of using both Cookies and Local Storage on both the client- and server-sides.
-
Thank you. Bye.