Data structures

What is data structures in computer science?
Data structures providing organization (relationships), management (functions or operations) and format for some data. Each data structure has own efficiency and optimizations.

Data structure is a collection of data values, the relationships among them, and the functions or operations that can be applied to the data, i.e., it is an algebraic structure about data.
— Wikipedia

array
- set
- stack
queue
linked data structure
- linked list
- singly-linked list
- doubly-linked list
- skip list
hash table
tree
Binary search tree: BST
heap / Priority Queue / Binary Heap
balanced search trees (general concept, not details)
traversals: preorder, inorder, postorder, BFS, DFS
graph
directed
undirected
adjacency matrix
adjacency list
traversals: BFS, DFS
Tries

Extra topics:

Balanced search trees
- AVL trees
- Splay trees
- Red/black trees
- 2-3 search trees
- 2-3-4 Trees (aka 2-4 trees)
- N-ary (K-ary, M-ary) trees
- B-Trees
k-D Trees

Data types

Programming languages

Primitive types

boolean
character
floating-point arithmetic
Fixed-point representation of the rationals
Integer, a direct representation of either the integers or the non-negative integers
Reference, sometimes erroneously referred to as a pointer or handle, is a value that refers to another value, possibly including itself
Symbol, a unique identifier
Enumerated type, a set of symbols
Complex, representation of complex numbers

Linear data structures

Trees

Hash-based structures

Graphs

Other

Data structures examples

An array is a number of elements in a specific order typically all the same type (depending on the language). Elements are accessed using an integer index (usually zero-indexed) to specify which element is required. Typical implementations allocate contiguous memory words for the elements of arrays. Arrays may be fixed-length or resizable.
A linked list (or list) is a linear collection of data elements of any type, called nodes, where each node has itself a value, and points to the next node in the linked list. The principal advantage of a linked list over an array is that values can always be efficiently inserted and removed without relocating the rest of the list. Certain other operations, such as random access to a certain element, are however slower on lists than on arrays.
A record (also called tuple or struct) is an aggregate data structure. A record is a value that contains other values, typically in fixed number and sequence and typically indexed by names. The elements of records are usually called fields or members. In the context of object-oriented programming, records are known as plain old data structures to distinguish them from objects.
A hash table, also known as hash maps and dictionary, are data structures that provide fast retrieval of values based on keys. They use a hashing function to map keys to indexes in an array, allowing for constant-time access in the average case. Hash tables are commonly used in dictionaries, caches, and database indexing. However, hash collisions can occur, which can impact their performance. Techniques like chaining and open addressing are employed to handle collisions.
A graph is collection of nodes connected by edges, representing relationships between entities. Graphs can be used to model social networks, computer networks, and transportation networks, among other things. They consist of vertices (nodes) and edges (connections between nodes). Graphs can be directed or undirected, and they can have cycles or be acyclic. Graph traversal algorithms include breadth-first search and depth-first search.
Stacks and queues are abstract data types that can be implemented using arrays or linked lists. A stack has two primary operations: push (adds an element to the top of the stack) and pop (removes the topmost element from the stack), that follow the Last In, First Out (LIFO) principle. Queues have two main operations: enqueue (adds an element to the rear of the queue) and dequeue (removes an element from the front of the queue) that follow the First In, First Out (FIFO) principle. So stack and queues follows LIFO and FIFO principles.
Trees represent a hierarchical organization of elements. A tree consists of nodes connected by edges, with one node being the root and all other nodes forming subtrees. Trees are widely used in various algorithms and data storage scenarios. Binary trees (particularly heaps, for example binary search tree), AVL trees, and B-trees are some popular types of trees. They enable efficient and optimal searching, sorting, and hierarchical representation of data.
A trie, also known as a prefix tree, is a specialized tree data structure used for the efficient retrieval of strings. Tries store characters of a string as nodes, with each edge representing a character. They are particularly useful in text processing scenarios like autocomplete, spell-checking, and dictionary implementations. Tries enabling fast searching and prefix-based operations on strings.

Stack follow LIFO principle (data manipulation and organization).

Queue follow FIFO principle (data manipulation and organization).

Language support

Do low-level and high-level programming languages have built-in support for data structures?
Low-level languages don’t have built-in support for data structures (or have very basic support). High-level languages have special syntax or have standard libraries with data structures implementations.

Some languages allow defining opaque data types, which allows hiding implementation details (in OOP languages typically used classes for this purpose).

Many known data structures have concurrent versions which allow multiple computing threads to access a single concrete instance of a data structure simultaneously.

Data structure operations

Which data structure operations (for array) are important?
Here is a list of the most common operations:

Read, lookup a value in array at a particular index.
Search, if value exist in array, return index.
Insert, add new value to additional slot within array.
Delete, remove value from array.

Why data structures matter?
Solid knowledge in data structures allows you to write scalable, fast and efficient software, increase your software expertise and make you happy 🙂.

Analyzing the number of steps an operation takes is the heart of understanding the performance of data structures and your program which use them. You must always analyze your application to see which data structure is a better fit.

Data structures refer to how data is organized. Does this data organization is matter for something?
Data organization can be highly impact on performance and efficiency. Your program can run fast and efficient or even not run at all (out of memory as example). Depending on data organization program can run faster or slower by orders of magnitude.

What is data type?
Data type is possible values + operations on them + representation as machine types. Data type (or simply type) is a collection or grouping of data values, usually specified by a set of possible values, a set of allowed operations on these values, and/or a representation of these values as machine types.

Modern programming languages offer ways to work with data without directly dealing with explicit types, you usually work through interfaces or abstract data.

How computers store data, is there some differences between data types in their final form?
Data itself is just basically numbers (including strings), in computers all data finally stored as bit sequences.

Most widely used data structure to store sequences?
An array.

Data structures implementation

Array and record data structures are based on computing the addresses of data items with arithmetic operations. Stack (Array Implementation).

Linked data structures are based on storing addresses of data items within the structure itself. Stack (Linked List Implementaion).

The implementation of a data structure usually requires writing a set of procedures that create and manipulate instances of that structure. The efficiency of a data structure cannot be analyzed separately from those operations. This observation motivates the theoretical concept of an abstract data type, a data structure that is defined indirectly by the operations that may be performed on it, and the mathematical properties of those operations (including their space and time cost).

Data structure efficiency analyzing with their procedures which create and manipulate instances of that structure, which manipulations you know?
insert, delete, search, etc.

To-do

imperative and functional versions
greedy
dynamic strategies
Fizz buzz - Wikipedia
Data Structure Visualization
Visualising data structures and algorithms through animation

Graph View

Data structures

Data types

Programming languages

Primitive types

Linear data structures

Trees

Hash-based structures

Graphs

Other

Data structures examples

Language support

Data structure operations

Data structures implementation

To-do

Table of Contents

Backlinks

Recent Notes

vue_tutorial

vue_introduction.ru

vue_quickstart.ru

vue

golden_path

atuin