Search notes:

Data

Definition of data

ISO 14721 defines data as
A reinterpretable representation of information in a formalized manner suitable for communication, interpretation, or processing. Examples of data include a sequence of bits, a table of numbers, the characters on a page, the recording of sounds made by a person speaking, or a moon rock specimen.

Two types of data

Two (three) types of data (a)
Two types of data (b) (See also level of measurement)

Structured and unstructured data

Data might also be divided into
Because of the complexity of unstructured data, Data profiling tools usually focus on structured and semi-structured data.

Challenges with increasing data volumes

The ever incresing volume of collected data causes challanges such as

TODO

TXR for data munging.
Visualize data with statistical graphics.
In order to get a value from data, it needs to be moved from one location to another.
The four W:
Regulations that require data hygiene, such as
DSVGO = Datenschutz-Grundverordnung, rules for processing data related to persons in the EU. Compare with the the CCPA, Califrona Consumer Privacy Act in the USA.

Storage

DAS: Direct-attached storage, connected through a HBA (Host Bus Adaptor)
NAS: Network-attached storage. Access files (rather than blocks)
Protocols: NFS, SMB, proprietary
SAN: Access blocks (rather than files)
Protocols: SCSI, iSCSI, Fibre Channel, Infiniband

See also

Data science: find knowledge and insights from data.
Data visualization
Data cleaning
Data warehouse (DWH)
Data profiling
open and closed data.
Data governance
Data life cycle
Data mart
metadata
Prefixes for Data Units (such as kilo, mega, giga etc.)
OLAP
Data preparation
Data wrangling
Enterprise Data Integration
Data modeling
geodata
Digital transformation
Data-mining applies statistics and pattern recognizion to discover knowledge from data.
Data migration
Linked data
Data quality
Data exchange
Data lake
Test data
OData is a protocol for creating and consuming data.
Data structures
binary data
Data amount
data sets
data corruption
DBMS - Database Management System
table
In SQL, the special null value is used for absent data.
The Wikidata entity Q42848.

Index