Skip to content

Latest commit

 

History

History
961 lines (729 loc) · 25.7 KB

README.md

File metadata and controls

961 lines (729 loc) · 25.7 KB

NDDB

Build Status

NDDB is a powerful and versatile object database for node.js and the browser.


NDDB (N-Dimensional DataBase) supports indexes, views, hashes, joins, group-by, basic statistics, custom operations, saving and loading from file system and browser localStorage, and much more.

Developer-friendly thanks to an easy api, detailed documentation, and wide test coverage.

List of features

  • Selecting: select, and, or
  • Sorting: sort, reverse, last, first, limit, distinct, shuffle
  • Indexing: view, index, hash, comparator
  • Custom callbacks: map, each, filter
  • Updating and Deletion: update, remove, clear
  • Advanced operations: split, join, concat, groupBy
  • Fetching and transformations: fetch, fetchArray, fetchKeyArray, fetchValues, fetchSubObj
  • Statistics operator: count, max, min, mean, stddev
  • Diff: diff, intersect
  • Skim: skim, keep
  • Iterator: previous, next, first, last
  • Tagging: tag
  • Event listener / emitter: on, off, emit
  • Saving and Loading: save, saveSync, load, loadSync, setWD, getWD, loadDir, loadDirSync

Usage

Load the library in Node.js:

const NDDB = require('NDDB');

// Backward-compatible mode.
// const NDDB = require('NDDB').NDDB;

or in the browser add a script tag in the page:

<!-- Must load a version of NDDB that includes JSUS (see 'build/' dir) -->
<script src="/path/to/nddb.js"></script>

Create an instance of NDDB:

let db = NDDB.db();
// let db = new NDDB(); // legacy

Insert an item into the database:

// Add one item to the database.
db.insert({
    painter: "Picasso",
    title: "Les Demoiselles d'Avignon",
    year: 1907
});

Import a collection of items:

let items = [
    {
        painter: "Dali",
        title: "Portrait of Paul Eluard",
        year: 1929,
        portrait: true
    },
    {
        painter: "Dali",
        title: "Barcelonese Mannequin",
        year: 1927
    },
    {
        painter: "Monet",
        title: "Water Lilies",
        year: 1906
    },
    {
        painter: "Monet",
        title: "Wheatstacks (End of Summer)",
        year: 1891
    },
    {
        painter: "Manet",
        title: "Olympia",
        year: 1863
    }
];

// Import an array of items at once.
db.importDB(items);

Retrieve the database size:

db.size(); // 6

Select Items

Select statements begin with select and can be refined with and and or statements. Select statements accept three input parameters:

  • 'property'
  • 'operator'
  • any additional number of arguments required by operator

Basic operators include standard logical operators:

  • =, ==, !=, >, >=, <, <=,

Advanced comparison operators include:

  • E: field exists (can be omitted, it is the default one)
  • ><: between values (expects an array as third parameter)
  • <>: not between values (expects an array as third parameter)
  • in: element is found in array (expects an array as third parameter)
  • !in: element is noi found in array (expects an array as third parameter)
  • LIKE: string SQL LIKE (case sensitive)
  • iLIKE: string SQL LIKE (case insensitive)

It is possible to access and compare nested properties simply separating them with ..

Select Examples

Select all paintings from Dali:

db.select('painter', '=', 'Dali'); // 2 items

Case sensitive LIKE operator:

db.select('painter', 'LIKE', 'M_net'); // 3 items

Select on multiple properties (*) with case insensitive LIKE:

db.select('*', 'iLIKE', '%e%'); // All items
db.select(['painter', 'portrait'], 'iLIKE', '%e%') // 5 items

Select all portraits:

// Property 'portrait' must not be undefined.
db.select('portrait'); // 1 item

Select all paintings from Dali that are before 1928:

db.select('painter', '=', 'Dali')
  .and('year', '<', 1928); // 1 item

Select all paintings of the beginning of XX's century:

db.select('year', '><', [1900, 1910]) // 2 items

Fetching items

Select statements are not evaluated until a fetch statement is invoked, returning the array of selected items, and preventing further chaining.

db.select('painter', '=', 'Dali').fetch();

// [
// {
//     painter: "Dali",
//     title: "Portrait of Paul Eluard",
//     year: 1929,
//     portrait: true
// },
// {
//     painter: "Dali",
//     title: "Barcelonese Mannequin",
//     year: 1927
// }
// ]

Other fetch methods can manipulate the items before they are returned.

// Create a new database without the items by Picasso.
let newDb = db.select('painter', '!=', 'Picasso').breed();

// fetchValues
//
// Fetch all the values of specified properties and return them in an object.
newDb.fetchValues(['painter', 'title']);

// {
//   painter: [ 'Dali', 'Dali', 'Monet', 'Monet', 'Manet' ],
//   year: [ 1929, 1927, 1906, 1891, 1863 ]
// }

// fetchSubObj
//
// Keeps only specified properties in the objects, before returning them in
// an array (items in the original database are NOT modified).
newDb.fetchSubObj(['painter', 'title']);

// [
//     {
//         painter: "Dali",
//         year: 1929
//     },
//     {
//         painter: "Dali",
//         year: 1927
//     },
//     {
//         painter: "Monet",
//         year: 1906
//     },
//     {
//         painter: "Monet",
//         year: 1891
//     },
//     {
//         painter: "Manet",
//         year: 1863
//     }
// ]    

// fetchArray
//
// Returns the items as arrays.
newDb.fetchArray()
// [
//  [ 'Dali', 'Portrait of Paul Eluard', 1929, true ],
//  [ 'Dali', 'Barcelonese Mannequin', 1927 ],
//  [ 'Monet', 'Water Lilies', 1906 ],
//  [ 'Monet', 'Wheatstacks (End of Summer)', 1891 ],
//  [ 'Manet', 'Olympia', 1863 ]
// ]


// fetchKeyArray
//
// Returns the items as arrays (including the keys).
newDb.fetchKeyArray()
// [
//   [
//     'painter', 'Dali', 'title', 'Portrait of Paul Eluard', 'year',
//     1929, 'portrait', true
//   ],
//   [ 'painter', 'Dali', 'title', 'Barcelonese Mannequin', 'year', 1927 ],
//   [ 'painter', 'Monet', 'title', 'Water Lilies', 'year', 1906 ],
//   [
//     'painter', 'Monet', 'title', 'Wheatstacks (End of Summer)', 'year', 1891
//   ],
//   [ 'painter', 'Manet', 'title', 'Olympia', 'year', 1863 ]
// ]

Sorting

Define a global comparator function that sorts all the entries chronologically:

db.globalCompator = function (o1, o2) {
    if (o1.year < o2.year) return -1;
    if (o1.year > o2.year) return 1;
    return 0;
};

Sort all the items (global comparator function is automatically used):

db.sort(); // Order: Manet, Monet, Monet, Picasso, Dali, Dali

Reverse the order of the items:

db.reverse(); // Order: Dali, Dali, Picasso, Monet, Monet, Manet

Define a custom comparator function for the name of the painter, which gives highest priorities to the canvases of Picasso:

db.compare('painter', function (o1, o2) {
    if (o1.painter === 'Picasso') return -1;
    if (o2.painter === 'Picasso') return 1;
});

Sort all the paintings by painter using the new comparator:

db.sort('painter'); // Picasso is always listed first.

Views

Splits the database in sub-databases, each containing semantically consistent set of entries:

// Let us add some cars to our previous database of paintings.
let cars = [
    {
      car: "Ferrari",
      model: "F10",
      speed: 350,
    },
    {
      car: "Fiat",
      model: "500",
      speed: 100,
    },
    {
      car: "BMW",
      model: "Z4",
      speed: 250,
    },
];

// Default view: returns items with the value
// of the property 'painter' !== undefined.
db.view('painter');

// Make the view function explicit.
db.view('art', function(o) {
  return o.painter;
});

db.view('cars', function(o) {
  return o.car;
});

db.rebuildIndexes();

db.size();          // 9
db.painter.size();  // NDDB with 6 art entries
db.art.size();      // NDDB with 6 art entries
db.cars.size();     // NDDB with 3 car entries

Hashing

Define a custom hash function that creates a new view for each of the painters in the database:

db.hash('painter');
// Or the equivalent explicit function definition.
db.hash('painter', function(o) {
    return o.painter;
});

db.rebuildIndexes();

db.size();          // 6, unchanged;
db.painter.Picasso; // NDDB with 1 element in db
db.painter.Monet    // NDDB with 2 elements in db
db.painter.Manet    // NDDB with 1 elements in db
db.painter.Dali     // NDDB with 2 elements in db

Listening to events

NDDB fires the following events: insert, update, remove, setwd, save, load. Users can listen to these events and modify their behavior.

Decorating objects on insert

Listen to the insert event and modify the inserted items by adding an index that is incremented sequentially:

let id = 0;
function getMyId(){ return id++; };

db.on('insert', function(item) {
    item.myId = getMyId();
});

Canceling operations: insert, update, remove.

Event listeners can block the execution of the operation by returning false. No errors are thrown.

// Insert event.
// Parameters:
//  - item: the item to insert.
db.on('insert', function(item) {
    if (item.year > 3000) return false; // Item is not added.
});

// Update event.
// Parameters:
//   - item: the item to update.
//   - update: an object containing the properties to update/add.
//   - idx: the index of the item in the reference database (note: in a
//          sub-selection, the index of the item may differ from its index
//          in the main database.)
db.on('update', function(item, update, idx) {
    if (update.year > 3000) return false; // Item is not updated.
});

// Remove event.
// Parameters:
//   - item: the item to remove.
//   - idx: the index of the item in the reference database (note: in a
//          sub-selection, the index of the item may differ from its index
//          in the main database.)
db.on('remove', function(item, idx) {
    if (item.year < 3000) return false; // Item is not removed.
});

Attention! The order in which the event listeners are added matters. If an event listener returns false, all successive event listeners are skipped.

Modifying save/load options.

// Save/load event (both sync or async).
// Parameters:
//   - options: object with the user options for the save/load event.
//   - info: an object containing information about the save/load command,
//           which cannot be altered. Format:
//           {
//               file:     'path/to/file.csv',
//               format:   'csv',
//               cb:       function() {},   // User defined function, if any.
//           }
//
db.on('save', function(options, info) {
    if (info.format === 'csv') {
        options.header = [ 'id', 'time', 'action'];  // Modify header.
    }
});

Intercept changes in working directory

// Set working directory event.
// Parameters:
//   - wd: The new working directory.
db.on('setwd', function(wd) {
    // Take note of the change, the value cannot be modified.
});

Indexes

Define a custom indexing function that gives fast, direct access to the items of the database;

db.index('id');
// Or the equivalent explicit function definition.
db.index('id', function(o) {
    return o.id;
});

db.rebuildIndexes();

db.id.get(0).name; // Picasso

db.id.update(0, {
  comment: "Good job Pablo!"
});

// Counts items in selection.
db.select('comment').count(); // 1

let picasso = db.id.remove(0);
db.size(); // (0)

// Get all available keys in the index
db.painter.getAllKeys(); // ['0','1', ... ]

// Get all elements indexed by their key in one object
db.painter.getAllKeyElements();

Default index

The property ._nddbid is added to every inserted item. The property is not enumerable (if the environment permits it), and all items are indexed against it:

db.nddbid.get('123456'); // Returns the item with nddbid equal to 123456.

Configuration Options

let logFunc = function(txt, level) {
  if (level > 0) {
    console.log(txt);
  }
};

let options = {
  tags:  {},          // Collection of tags
  update: {           // On every insert, remove and update:
    indexes:  true,   // Updates the indexes, if any
    sort:     true,   // Sorts the items of the database
    pointer:  true,   // Moves the iterator to the last inserted element
  },
  C:  {},             // Collection of comparator functions
  H:  {},             // Collection of hashing functions
  I:  {},             // Collection of indexing functions
  V:  {},             // Collection of view functions
  log: logFunc,       // Default stdout
  logCtx: logCtx      // The context of execution for the log function
  nddb_pointer: 4,    // Set the pointer to element of index 4
  globalCompare: function(o1, o2) {
    // Comparator.
  },
  filters: {          // Extends NDDB with new operators for select queries
    '%': function(d, value, comparator) {
          return function(elem) {
            if ((elem[d] % value) === 0) {
              return elem;
            }
          }
      }
  },
  share: {           // Contains objects that are copied by reference to
                     // in every new instance of NDDB.
    sharedObj: sharedObj
  }
}

let nddb = NDDB.db(options);

// or

nddb = NDDB.db();
nddb.init(options);

Saving and Loading Items

The items in the database can be saved and loaded using the save and load methods, or their synchronous implementations saveSync and loadSync.

The methods loadDir and loadDirSync load an entire directory.

The following formats are available: csv, json, and ndjson.

Saving and loading to file system (node.js environment)

Two formats are natively supported: .json and .csv (automatically detected by the filename's extension. For unknown extensions, NDDB falls back to the default format (json, but it can be overridden).

It is possible to specify new formats using the addFormat method.

Save/Load Examples

// SAVING.

// Saving items in JSON format.
db.save('db.json', () => console.log("Saved db into 'db.json'") );

// Saving items in CSV format.
db.save('db.csv', () => console.log("Saved db into db.csv'") );

// Saving items in CSV format.
db.save('db.ndjson', () => console.log("Saved db into db.ndjson'") );

// Saving items synchronously in CSV format.
db.saveSync('db.csv');
console.log("Saved db into db.csv'");

// Saving items in the default format (usually json).
db.getDefaultFormat(); // json
db.save('db.out', function() {
    console.log("Saved db into db.out'");
});

// Specifying the default format and saving into CSV.
db.setDefaultFormat('csv');
db.save('db.out', function() {
    console.log("Saved db into db.out'");
});

// LOADING.

// Loading items into database synchronously.
db.loadSync('db.csv');
console.log("Loaded csv file into database");

// Loading 'adapted' items into database.
db.load('db.csv', () => console.log("Loaded csv file into database") );

Loading an entire directory

The method loadDir and loadDirSync load an entire directory.

loadDir Options

In addition to the options of the native load method of the chosen format:

  • recursive: if TRUE, it will look into sub-directories. Default: FALSE.
  • maxRecLevel: the max level of recursion allowed. Default: 10.
  • filter: A filter function or a regex expression to apply to every file name.
  • dirFilter: A filter function or a regex expression to apply to every directory name.
  • onError: What to do in case of errors: 'continue' will skip the file with errors and go to the next one.
// Load files.
let opts = {
  recursive: true,
  filter: 'bonus',      // All files containing the word 'bonus'.
  dirFilter: (dir) => {
    return !~dir.indexOf("skip"); // Skip if directory contains word 'skip'.
  };  

  // Alternative filters:

  // filter: file => file === 'bonus.csv', // Only 'bonus.csv' files.
  // format: 'csv'     // All 'csv' files.
};

db.loadDirSync(DATADIR, opts);

Note: loadDir is not yet fully async. It loads files into the database asynchronously, but scans for files in the file system synchronously.

Adding a New Format

// Specify a new format.
db.addFormat('asd', {
   save: function(db, file, cb, options) {
         // save file asynchronously.
   },
   load: function(db, file, cb, options) {
         // load file asynchronously.
   },
   saveSync: function(db, file, cb, options) {
         // save file synchronously.
   },
   loadSync: function(db, file, cb, options) {
         // load file synchronously.
   }
});

// Saving in the new format.
db.save('db.asd');

Streaming items to file system (node.js environment)

The stream method automatically save items inserted into the database to the file system.

db.stream();
// Save items to [db name].[default format], for example: 'nddb.json'.

Stream options:

The stream method takes an optional configuration object:

  • format: the format: csv, json, ndbjson.
  • filename: path to file name (default [db name].[format])
  • delay: milliseconds to wait before copying items to file system (default 10)
  • journal: if TRUE, items are incapsulated in a data structure that contains information about the operation (insert, update, delete).

Journaling operations to file system (node.js environment)

The journal method keeps track of all operations (not just inserts).

db.journal();
// Save items to [db name].journal, for example: 'nddb.journal'.

This method is wrapper for the stream method with the journal flag TRUE.

Items are saved in ndjson format and can they imported in a new database with the importJournal method.

db.importJournal();
// All operations (inserts, updates, deletes) replayed.

CSV Advanced Options

Specifying an Adapter

// Transform items before saving them to CSV format.

let options = {
    adapter: {

        // Double all numbers in column "A".
        A: function(item) { return item.A * 2; },

        // Rename a property (must add shorterName to a custom header).
        shortName: 'muchLongerName'
    }
};

db.save('db2.csv', options, () => {
    console.log("Saved db as csv into 'db2.csv'");
    console.log("Numbers in column 'A' were doubled");
    console.log("Values in column 'shortName' are taken from column 'muchLongerName'");
});


// Transform items before loading them into database.
// Loading items into database.
options = {
    adapter: {
        A: function(item) { return item.A / 2; }
    }
};

db.load('db2.csv', options, () => {
   console.log("Loaded csv file into database");
   console.log("Numbers in column 'A' were doubled");
});

Saving Updates Only

If you know already when a new set of items are added to the database, you can save incremental updates using the updatesOnly flag.

// Feedback view already created.
db.comment.save('comments.csv', {

    // Custom header.
    header: [ 'timestamp', 'user', 'feedback' ],

    // Saves only updates from previous save command.
    updatesOnly: true
});

Flatten Items

If a single user enters multiple items in the database, but you need only one row in the CSV file, you can use the flatten flag.

db.view('user').save('users.csv', {

    // Custom header.
    header: [
        "user", "comment", "date", "name", "last", "rating"
    ],

    // Merges all items together.
    flatten: true,
});

If you have multiple users in the database, the option flattenByGroup will create one CSV row per group (e.g., user).

db.user.save('users.csv', {

    // Custom header.
    header: [
        "user", "comment", "date", "name", "last", "rating"
    ],

    // Merges all items together.
    flatten: true,

    // One row per user (can also be a function returning the id of the group).
    flattenByGroup: 'user',
});

In case you need to periodically flatten the items, use the flatten option in combination with the updatesOnly flag.

Setting the current working directory (node.js environment)

It is possible to specify the current working directory to avoid typing long file paths.

// Saving items in JSON format.
db.load('/home/this/user/on/that/dir/db.json');
db.load('/home/this/user/on/that/dir/db2.json');
db.load('/home/this/user/on/that/dir/db3.json');

// Can be shortened to:
db.setWD('/home/this/user/on/that/dir');
db.load('db.json');
db.load('db2.json');
db.load('db3.json');

// Get current working directory:
db.getWD(); // /home/this/user/on/that/dir/

List of all available options

{

    flags: 'w',                     // The Node.js flag to write to fs.
                                    // Default: 'a' (append).

    encoding: 'utf-8',              // The encoding of the file.

    mode: 0777,                     // The permission given to the file.
                                    // Default: 0666

    // Options below are CSV ONLY:

    header: true,                   // Loading:
                                    //  - true: use first line of
                                    //      file as key names (default)
                                    //  - false: use [ 'X1'...'XN' ]
                                    //      as key names;
                                    //  - array of strings: used as
                                    //      is as key names;
                                    //  - array of booleans: selects
                                    //      key names in order from
                                    //      columns in csv file
                                    //
                                    // Saving:
                                    //  - true: use keys of first
                                    //      item as column names (default)
                                    //  - 'all': collect all keys
                                    //      from all elements and use
                                    //      as column names
                                    //  - function: a callback that
                                    //      takes each unique key in
                                    //      the db and returns:
                                    //      another substitute string,
                                    //      an array of strings to add,
                                    //      null to exclude the key,
                                    //      undefined to keep it.
                                    //  - false: no header
                                    //  - array of strings: used as
                                    //      is for column names (keys
                                    //      not listed are omitted)

    adapter: {
        // Update the year property
        year: function(row) {       // An object containing callbacks for
            return row['year']-1;   // given csv column names. Callbacks take
        }                           // an object (a row of the csv file
    },                              // file on load, or an item of the
                                    // database on save) and return a value to
                                    // be saved/loaded under that property name.

    separator: ',',                 // The character used as separator
                                    // between values. Default ','.

    quote: '"',                     // The character used as quote.
                                    // Default: '"'.


    escapeCharacter: '\\',          // The char that should be skipped.
                                    // Default: \. (load only)

    lineBreak: '\n',                // Line break character. Default: system's
                                    // default.

    bufferSize: 128 * 1024,         // Number of bytes to read at once.
                                    // Default: 128 * 1024.


    // SAVE ONLY.

    bool2num: true,                 // If TRUE, booleans are converted to 0/1.

    na: 'NA',                       // Value for missing fields. Default: 'NA'.

    objectLevel: 2,                 // For saving only, the level of nested
                                    // objects to expand into csv columns

    flatten: true,                  // If TRUE, it flattens all items
                                    // currently selected into one row.

    flattenByGroup: 'player',       // If set, there will one row per unique
                                    // value of desired group (here: 'player')

    updatesOnly: true,              // If TRUE, saves only items that were
                                    // inserted into the database after
                                    // a file with the same name was last saved.

    updateDelay: 20000,             // Number of milliseconds to wait before
                                    // saving updates. Default: 10000.

}

Test

NDDB relies on mocha and should.js for testing.

$ npm install # will load all necessary dependencies
$ npm test # will run the test suite against nddb.js

Build

Create your customized build of NDDB using the make file in the bin directory:

In order to run in the browser NDDB needs to have JSUS loaded. You can include it separately, or create a new build that includes it already. See the build help for options.

node make build // Standard build,
node make build -a -o nddb-full // Full build

The build file will be created inside the build/ directory.

License

MIT