Skip to content

Latest commit

 

History

History
936 lines (687 loc) · 29 KB

README.md

File metadata and controls

936 lines (687 loc) · 29 KB

go-collections

Go Report Card Build Status Version License GoDoc

go-collections is a Go package that provides implementations of common data structures, including a double-ended queue (Deque), various stack implementations, a linked list, a queue, a trie, a priority queue, a binary search tree, a skip list, and a graph. This package offers a simple and efficient way to use these structures in Go, with support for generic types.

Table of Contents

  1. Installation
  2. Usage
  3. Data Structures
  4. License

You can install the package using the Go module system:

go get github.com/idsulik/go-collections/v2

Here is a brief example of how to use the Deque:

package main

import (
  "fmt"
  "github.com/idsulik/go-collections/v2/deque"
)

func main() {
  d := deque.New[int](0)
  d.PushBack(1)
  d.PushFront(2)

  front, _ := d.PopFront()
  back, _ := d.PopBack()

  fmt.Println(front) // Output: 2
  fmt.Println(back)  // Output: 1
}

A set represents a collection of unique items.

Type Set[T comparable]

  • Constructor:

    func New[T comparable]() *Set[T]
  • Methods:

    • Add(item T): Adds an item to the set.
    • Remove(item T): Removes an item from the set.
    • Has(item T) bool: Returns true if the set contains the specified item.
    • Clear(): Removes all items from the set.
    • Len() int: Returns the number of items in the set.
    • IsEmpty() bool: Returns true if the set is empty.
    • Elements() []T: Returns a slice containing all items in the set.
    • AddAll(items ...T): Adds multiple items to the set.
    • RemoveAll(items ...T): Removes multiple items from the set.
    • Diff(other *Set[T]) *Set[T]: Returns a new set containing items that are in the receiver set but not in the other set.
    • Intersect(other *Set[T]) *Set[T]: Returns a new set containing items that are in both the receiver set and the other set.
    • Union(other *Set[T]) *Set[T]: Returns a new set containing items that are in either the receiver set or the other set.
    • IsSubset(other *Set[T]) bool: Returns true if the receiver set is a subset of the other set.
    • IsSuperset(other *Set[T]) bool: Returns true if the receiver set is a superset of the other set.
    • Equal(other *Set[T]) bool: Returns true if the receiver set is equal to the other set.

A double-ended queue (Deque) allows adding and removing elements from both the front and the back.

Type Deque[T any]

  • Constructor:

    func New[T any](initialCapacity int) *Deque[T]
  • Methods:

    • PushFront(item T): Adds an item to the front of the deque.
    • PushBack(item T): Adds an item to the back of the deque.
    • PopFront() (T, bool): Removes and returns the item at the front of the deque.
    • PopBack() (T, bool): Removes and returns the item at the back of the deque.
    • PeekFront() (T, bool): Returns the item at the front of the deque without removing it.
    • PeekBack() (T, bool): Returns the item at the back of the deque without removing it.
    • Len() int: Returns the number of items in the deque.
    • IsEmpty() bool: Checks if the deque is empty.
    • Clear(): Removes all items from the deque.

A singly linked list where elements can be added or removed from both the front and the end.

Type LinkedList[T any]

  • Constructor:

    func New[T any]() *LinkedList[T]
  • Methods:

    • AddFront(value T): Adds a new node with the given value to the front of the list.
    • AddBack(value T): Adds a new node with the given value to the end of the list.
    • PeekFront() (T, bool): Returns the value of the node at the front without removing it.
    • PeekBack() (T, bool): Returns the value of the node at the end without removing it.
    • RemoveFront() (T, bool): Removes the node from the front and returns its value.
    • RemoveBack() (T, bool): Removes the node from the end and returns its value.
    • Iterate(fn func(T) bool): Iterates over the list and applies a function to each node's value until the function returns false or the end is reached.
    • IsEmpty() bool: Checks if the list is empty.
    • Size() int: Returns the number of elements in the list.
    • Clear(): Removes all elements from the list.

A FIFO (first-in, first-out) queue that supports basic queue operations.

Type Queue[T any]

  • Constructor:

    func New[T any](initialCapacity int) *Queue[T]
  • Methods:

    • Enqueue(item T): Adds an item to the end of the queue.
    • Dequeue() (T, bool): Removes and returns the item at the front of the queue.
    • Peek() (T, bool): Returns the item at the front of the queue without removing it.
    • Len() int: Returns the number of items currently in the queue.
    • IsEmpty() bool: Checks if the queue is empty.
    • Clear(): Removes all items from the queue.

An interface representing a LIFO (last-in, first-out) stack.

Type Stack[T any]

  • Methods:

    • Push(item T): Adds an item to the top of the stack.
    • Pop() (T, bool): Removes and returns the item from the top of the stack.
    • Peek() (T, bool): Returns the item at the top without removing it.
    • Len() int: Returns the number of items in the stack.
    • IsEmpty() bool: Checks if the stack is empty.
    • Clear(): Removes all items from the stack.

An array-based stack implementation using a slice.

Type ArrayStack[T any]

  • Constructor:

    func New[T any](initialCapacity int) *ArrayStack[T]
  • Implements: Stack[T]

  • Methods:

    • Push(item T): Adds an item to the top of the stack.
    • Pop() (T, bool): Removes and returns the item from the top.
    • Peek() (T, bool): Returns the item at the top without removing it.
    • Len() int: Returns the number of items currently in the stack.
    • IsEmpty() bool: Checks if the stack is empty.
    • Clear(): Removes all items, leaving the stack empty.

A linked list-based stack implementation.

Type LinkedListStack[T any]

  • Constructor:

    func New[T any]() *LinkedListStack[T]
  • Implements: Stack[T]

  • Methods:

    • Push(item T): Adds an item to the top of the stack.
    • Pop() (T, bool): Removes and returns the item from the top.
    • Peek() (T, bool): Returns the item at the top without removing it.
    • Len() int: Returns the number of items currently in the stack.
    • IsEmpty() bool: Checks if the stack is empty.
    • Clear(): Removes all items from the stack.

A Trie (prefix tree) data structure that supports insertion and search operations for words and prefixes.

Type Trie

  • Constructor:

    func New() *Trie
  • Methods:

    • Insert(word string): Adds a word to the Trie.
    • Search(word string) bool: Checks if the word exists in the Trie.
    • StartsWith(prefix string) bool: Checks if there is any word in the Trie that starts with the given prefix.

A priority queue allows for efficient retrieval and removal of the highest (or lowest) priority element. It's commonly used in algorithms like Dijkstra's shortest path and task scheduling.

Type PriorityQueue[T any]

  • Constructor:

    func New[T any](less func(a, b T) bool) *PriorityQueue[T]
    • less: A comparison function that determines the priority of elements. If less(a, b) returns true, then a has higher priority than b.
  • Methods:

    • Push(item T): Adds an item to the priority queue.
    • Pop() (T, bool): Removes and returns the highest priority item.
    • Peek() (T, bool): Returns the highest priority item without removing it.
    • Len() int: Returns the number of items in the priority queue.
    • IsEmpty() bool: Checks if the priority queue is empty.
    • Clear(): Removes all items from the priority queue.

A Binary Search Tree (BST) maintains elements in sorted order, allowing for efficient insertion, deletion, and lookup operations. Each node has at most two children, with left child values less than the parent and right child values greater.

Type BST[T Ordered]

  • Constructor:

    func New[T Ordered]() *BST[T]
    • T Ordered: A type constraint that ensures the elements can be compared using < and > operators. Supported types include integers, floats, and strings.
  • Methods:

    • Insert(value T): Inserts a value into the BST.
    • Remove(value T): Removes a value from the BST.
    • Contains(value T) bool: Checks if a value exists in the BST.
    • InOrderTraversal(fn func(T)): Traverses the BST in order and applies a function to each node's value.
    • Len() int: Returns the number of nodes in the BST.
    • IsEmpty() bool: Checks if the BST is empty.
    • Clear(): Removes all nodes from the BST.

A Skip List is a probabilistic data structure that allows fast search, insertion, and deletion operations within an ordered sequence of elements. It achieves efficiency by maintaining multiple levels of linked lists, where each higher level skips over a larger number of elements, allowing operations to be performed in O(log n) average time.

Type SkipList[T Ordered]

  • Constructor:

    func New[T Ordered](maxLevel int, p float64) *SkipList[T]
    • maxLevel: The maximum level of the skip list (controls the space vs. time trade-off).
    • p: The probability factor used to determine the level of new nodes (usually set to 0.5).
  • Methods:

    • Insert(value T): Inserts a value into the skip list.
    • Delete(value T): Deletes a value from the skip list.
    • Search(value T) bool: Searches for a value in the skip list.
    • Len() int: Returns the number of elements in the skip list.
    • IsEmpty() bool: Checks if the skip list is empty.
    • Clear(): Removes all elements from the skip list.

Represents networks of nodes and edges, suitable for various algorithms like search, shortest path, and spanning trees.

Type Graph[T comparable]

  • Constructor:

    func New[T comparable](directed bool) *Graph[T]
    • directed: Specifies whether the graph is directed or undirected.
  • Methods:

    • AddNode(value T): Adds a node to the graph.
    • AddEdge(from, to T, weight float64): Adds an edge between two nodes with an optional weight.
    • RemoveNode(value T): Removes a node and all connected edges.
    • RemoveEdge(from, to T): Removes an edge between two nodes.
    • Neighbors(value T) []T: Returns adjacent nodes.
    • HasNode(value T) bool: Checks if a node exists.
    • HasEdge(from, to T) bool: Checks if an edge exists.
    • GetEdgeWeight(from, to T) (float64, bool): Retrieves the weight of the edge between two nodes.
    • Traverse(start T, visit func(T)): Traverses the graph from a starting node using Breadth-First Search.
    • Nodes() []T: Returns a slice of all node values in the graph.
    • Edges() [][2]T: Returns a slice of all edges in the graph.

A Bloom Filter is a space-efficient probabilistic data structure used to test whether an element is a member of a set. False positive matches are possible, but false negatives are not. Elements can be added to the set, but not removed.

Type BloomFilter[T any]

  • Constructor:

    func NewBloomFilter[T any](expectedItems uint, falsePositiveProb float64) *BloomFilter[T]
    • expectedItems: Expected number of items to be added to the filter
    • falsePositiveProb: Desired false positive probability (between 0 and 1)
  • Methods:

    • Add(item T): Adds an item to the Bloom Filter.
    • Contains(item T) bool: Tests whether an item might be in the set.
    • EstimatedFalsePositiveRate() float64: Returns the current estimated false positive rate.
    • Clear(): Removes all items from the Bloom Filter.
    • Len() int: Returns the number of items added to the Bloom Filter.
    • IsEmpty() bool: Returns true if no items have been added.
    • BitSize() uint: Returns the size of the underlying bit array.
    • NumberOfHashes() uint: Returns the number of hash functions being used.

Example Usage:

// Create a new Bloom Filter expecting 1000 items with 1% false positive rate
bf := collections.NewBloomFilter[string](1000, 0.01)

// Add some items
bf.Add("apple")
bf.Add("banana")
bf.Add("cherry")

// Check for membership
if bf.Contains("apple") {
    fmt.Println("'apple' is probably in the set")
}

// Get current false positive rate
fmt.Printf("False positive rate: %f\n", bf.EstimatedFalsePositiveRate())

// Clear the filter
bf.Clear()

Performance Characteristics:

  • Space Complexity: O(m), where m is the size of the bit array
  • Time Complexity:
    • Add: O(k), where k is the number of hash functions
    • Contains: O(k), where k is the number of hash functions
  • False Positive Probability: (1 - e^(-kn/m))^k
    • k: number of hash functions
    • n: number of inserted elements
    • m: size of bit array

Use Cases:

  • Duplicate detection
  • Cache filtering
  • URL shorteners
  • Spell checkers
  • Network routing
  • Database query optimization

Notes:

  • The Bloom Filter automatically optimizes the number of hash functions and bit array size based on the expected number of items and desired false positive rate.
  • The actual false positive rate may vary slightly from the target rate due to the probabilistic nature of the data structure.
  • The filter supports any type that can be converted to a string representation.

A Ring Buffer (also known as a Circular Buffer) is a fixed-size buffer that wraps around to the beginning when it reaches the end. It's particularly useful for streaming data processing, implementing queues with size limits, and managing buffers in embedded systems.

Type RingBuffer[T any]

  • Constructor:
  func New[T any](capacity int) *RingBuffer[T]
  • Methods:

  • Write(item T) bool: Adds an item to the buffer. Returns false if the buffer is full.

  • Read() (T, bool): Removes and returns the oldest item from the buffer.

  • Peek() (T, bool): Returns the oldest item without removing it.

  • IsFull() bool: Returns true if the buffer is at capacity.

  • IsEmpty() bool: Returns true if the buffer contains no items.

  • Cap() int: Returns the total capacity of the buffer.

  • Len() int: Returns the current number of items in the buffer.

  • Clear(): Removes all items from the buffer.

Example Usage:

package main

import (
    "fmt"
    "github.com/idsulik/go-collections/v2/ringbuffer"
)

func main() {
    // Create a new ring buffer with capacity 3
    rb := ringbuffer.New[string](3)

    // Add some items
    rb.Write("first")
    rb.Write("second")
    rb.Write("third")

    // Buffer is now full
    fmt.Println(rb.IsFull()) // Output: true

    // Read an item
    value, ok := rb.Read()
    if ok {
        fmt.Println(value) // Output: "first"
    }

    // Now we can write another item
    rb.Write("fourth")

    // Read all remaining items
    for !rb.IsEmpty() {
        if value, ok := rb.Read(); ok {
            fmt.Println(value)
        }
    }
}

Performance Characteristics:

  • Space Complexity: O(n), where n is the buffer capacity
  • Time Complexity:
  • Write: O(1)
  • Read: O(1)
  • Peek: O(1)
  • Clear: O(1)

Use Cases:

  • Streaming data processing
  • Fixed-size queues
  • Audio/video buffering
  • Producer-consumer scenarios
  • Network packet buffering
  • Event handling systems
  • Embedded systems with memory constraints

A Segment Tree is a versatile data structure that supports various range query operations (sum, minimum, maximum, GCD, etc.) with efficient updates. It provides O(log n) complexity for both range queries and point updates.

Type SegmentTree[T any]

  • Constructor:

    func NewSegmentTree[T any](arr []T, identity T, combine Operation[T]) *SegmentTree[T]
    • arr: Initial array of elements
    • identity: Identity element for the operation (e.g., 0 for sum, Inf for min)
    • combine: Function that defines how to combine elements (e.g., addition for sum queries)
  • Methods:

    • Update(index int, value T): Updates the value at the given index
    • Query(left, right int) T: Returns the result of the operation for the range [left, right]

Example Usage:

// Create a segment tree for range sum queries
arr := []int{1, 3, 5, 7, 9}
st := collections.NewSegmentTree(arr, 0, func(a, b int) int { return a + b })

// Query range sum
sum := st.Query(1, 3) // Sum of elements from index 1 to 3

// Update value
st.Update(2, 6) // Change value at index 2 to 6

// Create a segment tree for range minimum queries
minSt := collections.NewSegmentTree(arr, math.Inf(1), func(a, b float64) float64 {
    return math.Min(a, b)
})

Performance Characteristics:

  • Construction: O(n)
  • Range Query: O(log n)
  • Point Update: O(log n)
  • Space Complexity: O(n)

Use Cases:

  • Range sum/min/max queries with updates
  • Finding GCD/LCM over ranges
  • Statistical range queries
  • Competitive programming
  • Database query optimization

A Disjoint Set (also known as Union-Find) is a data structure that keeps track of elements partitioned into non-overlapping subsets. It provides near-constant-time operations to merge sets and determine if two elements belong to the same set.

Type DisjointSet[T comparable]

  • Constructor:

    func New[T comparable]() *DisjointSet[T]
  • Methods:

    • MakeSet(x T): Creates a new set containing a single element.
    • Find(x T) T: Returns the representative element of the set containing x.
    • Union(x, y T): Merges the sets containing elements x and y.
    • Connected(x, y T) bool: Returns true if elements x and y are in the same set.
    • Clear(): Removes all elements from the disjoint set.
    • Len() int: Returns the number of elements in the disjoint set.
    • IsEmpty() bool: Returns true if the disjoint set contains no elements.
    • GetSets() map[T][]T: Returns a map of representatives to their set members.

Example Usage:

package main

import (
    "fmt"
    "github.com/idsulik/go-collections/v2/disjointset"
)

func main() {
    // Create a new disjoint set
    ds := disjointset.New[string]()

    // Create individual sets
    ds.MakeSet("A")
    ds.MakeSet("B")
    ds.MakeSet("C")
    ds.MakeSet("D")

    // Merge sets
    ds.Union("A", "B")
    ds.Union("C", "D")

    // Check if elements are in the same set
    fmt.Println(ds.Connected("A", "B")) // true
    fmt.Println(ds.Connected("A", "C")) // false

    // Get all sets
    sets := ds.GetSets()
    for root, elements := range sets {
        fmt.Printf("Set with root %v: %v\n", root, elements)
    }
}

Performance Characteristics:

  • MakeSet: O(1)
  • Find: O(α(n)) amortized (nearly constant)
  • Union: O(α(n)) amortized (nearly constant)
  • Connected: O(α(n)) amortized (nearly constant)

Where α(n) is the inverse Ackermann function, which grows extremely slowly and is effectively constant for all practical values of n.

Use Cases:

  • Detecting cycles in graphs
  • Finding connected components
  • Network connectivity
  • Image processing (connected component labeling)
  • Kruskal's minimum spanning tree algorithm
  • Dynamic connectivity problems
  • Online dynamic connectivity
  • Percolation analysis

An AVL tree is a self-balancing binary search tree where the heights of the two child subtrees of any node differ by at most one. This balancing ensures O(log n) time complexity for insertions, deletions, and lookups.

Type AVLTree[T any]

  • Constructor:

    func New[T any](compare func(a, b T) int) *AVLTree[T]
    • compare: A comparison function that returns:
      • -1 if a < b
      • 0 if a == b
      • 1 if a > b
  • Methods:

    • Insert(value T): Adds a value to the tree while maintaining AVL balance
    • Delete(value T) bool: Removes a value from the tree while maintaining AVL balance
    • Search(value T) bool: Checks if a value exists in the tree
    • InOrderTraversal(fn func(T)): Visits all nodes in ascending order
    • Clear(): Removes all elements from the tree
    • Size() int: Returns the number of nodes in the tree
    • IsEmpty() bool: Checks if the tree is empty
    • Height() int: Returns the height of the tree

Example Usage:

package main

import (
    "fmt"
    "github.com/idsulik/go-collections/v2/avltree"
)

// Compare function for integers
func compareInts(a, b int) int {
    if a < b {
        return -1
    }
    if a > b {
        return 1
    }
    return 0
}

func main() {
    // Create a new AVL tree
    tree := avl.New[int](compareInts)

    // Insert values
    values := []int{50, 30, 70, 20, 40, 60, 80}
    for _, v := range values {
        tree.Insert(v)
    }

    // Search for values
    fmt.Println(tree.Search(30)) // true
    fmt.Println(tree.Search(45)) // false

    // Traverse the tree in order
    tree.InOrderTraversal(func(value int) {
        fmt.Printf("%d ", value)
    }) // Output: 20 30 40 50 60 70 80

    // Delete a value
    tree.Delete(30)

    // Check size and height
    fmt.Printf("\nSize: %d, Height: %d\n", tree.Size(), tree.Height())
}

Performance Characteristics:

Operation Average Case Worst Case
Space O(n) O(n)
Search O(log n) O(log n)
Insert O(log n) O(log n)
Delete O(log n) O(log n)

Where n is the number of nodes in the tree.

Balance Property:

The AVL tree maintains the following invariants:

  1. The heights of the left and right subtrees of any node differ by at most 1
  2. Balance factor = height(left subtree) - height(right subtree)
  3. Balance factor must be in {-1, 0, 1} for all nodes

Use Cases:

  1. Database Indexing:

    • Maintaining sorted indices for fast lookups
    • Range queries on indexed columns
  2. Memory Management:

    • Managing memory allocation segments
    • Virtual memory page tables
  3. File System:

    • Directory structures
    • File allocation tables
  4. Network Routing:

    • IP routing tables
    • Network flow optimization
  5. Game Development:

    • Spatial partitioning
    • Scene graph management

A Red-Black Tree is a self-balancing binary search tree where each node is colored either red or black, following specific rules that maintain balance. This ensures O(log n) time complexity for insertions, deletions, and lookups.

Type RedBlackTree[T any]

  • Constructor:
func New[T any](compare func(a, b T) int) *RedBlackTree[T]
  • compare: A comparison function that returns:

    • -1 if a < b
    • 0 if a == b
    • 1 if a > b
  • Methods:

    • Insert(value T): Adds a value to the tree while maintaining Red-Black properties
    • Delete(value T) bool: Removes a value from the tree while maintaining Red-Black properties
    • Search(value T) bool: Checks if a value exists in the tree
    • InOrderTraversal(fn func(T)): Visits all nodes in ascending order
    • Clear(): Removes all elements from the tree
    • Size() int: Returns the number of nodes in the tree
    • IsEmpty() bool: Checks if the tree is empty
    • Height() int: Returns the height of the tree

Example Usage:

package main

import (
    "fmt"
    "github.com/idsulik/go-collections/v2/rbtree"
)

// Compare function for integers
func compareInts(a, b int) int {
    if a < b {
        return -1
    }
    if a > b {
        return 1
    }
    return 0
}

func main() {
    // Create a new Red-Black tree
    tree := rbtree.New[int](compareInts)

    // Insert values
    values := []int{50, 30, 70, 20, 40, 60, 80}
    for _, v := range values {
        tree.Insert(v)
    }

    // Search for values
    fmt.Println(tree.Search(30))  // true
    fmt.Println(tree.Search(45))  // false

    // Traverse the tree in order
    tree.InOrderTraversal(func(value int) {
        fmt.Printf("%d ", value)
    }) // Output: 20 30 40 50 60 70 80

    // Delete a value
    tree.Delete(30)

    // Check size and height
    fmt.Printf("\nSize: %d, Height: %d\n", tree.Size(), tree.Height())
}

Performance Characteristics:

Operation Average Case Worst Case
Space O(n) O(n)
Search O(log n) O(log n)
Insert O(log n) O(log n)
Delete O(log n) O(log n)

Where n is the number of nodes in the tree.

Balance Property:

The Red-Black tree maintains the following invariants:

  1. Every node is either red or black
  2. The root is always black
  3. All leaves (NIL nodes) are black
  4. If a node is red, both its children are black
  5. Every path from root to leaf contains the same number of black nodes

Use Cases:

  1. Database Systems:
  • Index structures implementation
  • Multi-version concurrency control
  1. Memory Management:
  • Virtual memory segment trees
  • Memory allocation tracking
  1. File Systems:
  • Directory organization
  • File system journaling
  1. Process Scheduling:
  • Task prioritization
  • Real-time scheduling
  1. Programming Languages:
  • Symbol table implementation
  • Garbage collection algorithms

Performance Comparison

Data Structure Access Search Insertion Deletion Space
Array O(1) O(n) O(n) O(n) O(n)
Set O(1) O(1) O(1) O(1) O(n)
Queue O(1) O(n) O(1)* O(1)* O(n)
Priority Queue O(1) O(1) O(log n) O(log n) O(n)
BST (balanced) O(log n) O(log n) O(log n) O(log n) O(n)
Trie O(m) O(m) O(m) O(m) O(ALPHABET_SIZE * m * n)
Graph O(1) O(V+E) O(E) O(E) O(V+E)
BloomFilter N/A O(k) O(k) N/A O(m)
Disjoint Set O(α(n)) O(α(n)) O(α(n)) O(α(n)) O(n)
RingBuffer O(1) O(n) O(1) O(1) O(n)
SkipList O(1) O(log n) O(log n) O(log n) O(n log n)
LinkedList O(n) O(n) O(1) O(1)** O(n)
SegmentTree O(1) O(log n) O(log n) O(log n) O(n)
ArrayStack O(1) O(n) O(1)* O(1)* O(n)
LinkedListStack O(1) O(n) O(1) O(1) O(n)
Deque O(1) O(n) O(1)* O(1)* O(n)

Where:

  • n is the number of elements
  • m is the length of the string/key
  • k is the number of hash functions
  • V is the number of vertices
  • E is the number of edges
  • α(n) is the inverse Ackermann function (effectively constant)
    • indicates amortized time complexity
  • ** for LinkedList, deletion is O(1) at front/back but O(n) for arbitrary position

Contributing

Contributions are welcome! Please read our Contributing Guide for details on our code of conduct and the process for submitting pull requests.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

  • Thanks to all contributors who have helped shape this library
  • Inspired by various Go community projects and standard library patterns