Skip to content

Latest commit

 

History

History
298 lines (195 loc) · 17.6 KB

README.md

File metadata and controls

298 lines (195 loc) · 17.6 KB

TagStudio (Alpha): A User-Focused Document Management System

Caution

This is still a very rough personal project of mine in its infancy. I’m open-sourcing it now in order to accept contributors sooner and to better facilitate the direction of the project from an earlier stage. There are bugs, and there will very likely be breaking changes!

TagStudio is a photo & file organization application with an underlying system that focuses on giving freedom and flexibility to the user. No proprietary programs or formats, no sea of sidecar files, and no complete upheaval of your filesystem structure.

Contents

Goals

  • To achieve a portable, privacy-oriented, open, extensible, and feature-rich system of organizing and rediscovering files.
  • To provide powerful methods for organization, notably the concept of tag composition, or “taggable tags”.
  • To create an implementation of such a system that is resilient against a user’s actions outside the program (modifying, moving, or renaming files) while also not burdening the user with mandatory sidecar files or otherwise requiring them to change their existing file structures and workflows.
  • To support a wide range of users spanning across different platforms, multi-user setups, and those with large (several terabyte) libraries.
  • To make the darn thing look like nice, too. It’s 2024, not 1994.

Priorities

  1. The concept. Even if TagStudio as a project or application fails, I’d hope that the idea lives on in a superior project. The goals outlined above don’t reference TagStudio once - TagStudio is what references the goals.
  2. The system. Frontends and implementations can vary, as they should. The core underlying metadata management system is what should be interoperable between different frontends, programs, and operating systems. A standard implementation for this should settle as development continues. This opens up the doors for improved and varied clients, integration with third-party applications, and more.
  3. The application. If nothing else, TagStudio the application serves as the first (and so far only) implementation for this system of metadata management. This has the responsibility of doing the idea justice and showing just what’s possible when it comes to user file management.
  4. (The name.) I think it’s fine for an app or client, but it doesn’t really make sense for a system or standard. I suppose this will evolve with time.

Current Features

  • Create libraries/vaults centered around a system directory. Libraries contain a series of entries: the representations of your files combined with metadata fields. Each entry represents a file in your library’s directory, and is linked to its location.
  • Add metadata to your library entries, including:
    • Name, Author, Artist (Single-Line Text Fields)
    • Description, Notes (Multiline Text Fields)
    • Tags, Meta Tags, Content Tags (Tag Boxes)
  • Create rich tags composed of a name, a list of aliases, and a list of “subtags” - being tags in which these tags inherit values from.
  • Search for entries based on tags, metadata (TBA), or filenames/filetypes (using filename: <query>)
  • Special search conditions for entries that are: untagged/no tags and empty/no fields.

Note

For more information on the project itself, please see the FAQ section and other docs.

Installation

To download TagStudio, visit the Releases section of the GitHub repository and download the latest release for your system. TagStudio is available for Windows, macOS (Apple Silicon & Intel), and Linux. Windows and Linux builds are also available in portable versions if you want a more self-contained executable to move around.

Note

On macOS, you may be met with a message saying ""TagStudio" can't be opened because Apple cannot check it for malicious software." If you encounter this, then you'll need to go to the "Settings" app, navigate to "Privacy & Security", and scroll down to a section that says ""TagStudio" was blocked from use because it is not from an identified developer." Click the "Open Anyway" button to allow TagStudio to run. You should only have to do this once after downloading the application.

Optional Arguments

Optional arguments to pass to the program.

--open <path> / -o <path> Path to a TagStudio Library folder to open on start.

--config-file <path> / -c <path> Path to the TagStudio config file to load.

Usage

Creating/Opening a Library

With TagStudio opened, start by creating a new library or opening an existing one using File -> Open/Create Library from the menu bar. TagStudio will automatically create a new library from the chosen directory if one does not already exist. Upon creating a new library, TagStudio will automatically scan your folders for files and add those to your library (no files are moved during this process!).

Refreshing the Library

In order to scan for new files or file changes, you’ll need to manually go to File -> Refresh Directories.

Note

In the future, library refreshing will also be automatically done in the background, or additionally on app startup.

Adding Metadata to Entries

To add a metadata field to a file entry, start by clicking the “Add Field” button under the file preview in the right-hand preview panel. From the dropdown menu, select the type of metadata field you’d like to add to the entry.

Editing Metadata Fields

Text Line / Text Box

Hover over the field and click the pencil icon. From there, add or edit text in the dialog box popup.

Tag Box

Click the “+” button at the end of the Tags list, and search for tags to add inside the new dialog popup. Click the “+” button next to whichever tags you want to add. Alternatively, after you search for a tag, press the Enter/Return key to add the add the first item in the list. Press Enter/Return once more to close the dialog box

Warning

Keyboard control and navigation is currently very buggy, but will be improved in future versions.

Creating Tags

To create a new tag, click on Edit -> New Tag from the menu bar. From there, enter a tag name, shorthand name, any tag aliases separated by newlines, any subtags, and an optional color.

  • The tag shorthand is a type of alias that displays in situations when screen space is more valuable (ex. as a subtag for other tags).
  • Aliases are alternate names for a tag. These let you search for terms other than the exact tag name in order to find the tag again.
  • Subtags are tags in which this tag is a child tag of. In other words, tags under this section are parents of this tag. For example, if you had a tag for a character from a show, you would make the show a subtag of this character. This would display as “Character (Show)” in most areas of the app. The first tag in this list is used as the tag shown in parentheses for specification.
  • The color dropdown lets you select an optional color for this tag to display as.

Editing Tags

To edit a tag, right-click the tag in the tag field of the preview pane and select “Edit Tag”

Warning

There is currently no method to view all tags that you’ve created in your library. This is a top priority for future releases.

Relinking Renamed/Moved Files

Inevitably, some of the files inside your library will be renamed, moved, or deleted. If a file has been renamed or moved, TagStudio will display the thumbnail as a red tag with a cross through it (this icon is also used for items with broken thumbnails). To relink moved files or delete these entries, go to Tools -> Manage Unlinked Entries. Click the “Refresh” button to scan your library for unlinked entries. Once complete, you can attempt to “Search & Relink” any unlinked entries to their respective files, or “Delete Unlinked Entries” in the event the original files have been deleted and you no longer wish to keep their metadata entries inside your library.

Warning

There is currently no method to relink entries to files that have been renamed - only moved or deleted. This is a top priority for future releases.

Warning

If multiple matches for a moved file are found (matches are currently defined as files with a matching filename as the original), TagStudio will currently ignore the match groups. Adding a GUI for manual selection, as well as smarter automated relinking, are top priorities for future versions.

Saving the Library

Libraries are saved upon exiting the program. To manually save, select File -> Save Library from the menu bar. To save a backup of your library, select File -> Save Library Backup from the menu bar.

Half-Implemented Features

Fix Duplicate Files

Load in a .dupeguru file generated by dupeGuru and mirror metadata across entries marked as duplicates. After mirroring, return to dupeGuru to manage deletion of the duplicate files. After deletion, use the “Fix Unlinked Entries” feature in TagStudio to delete the duplicate set of entries for the now-deleted files

Caution

While this feature is functional, it’s a pretty roundabout process and can be streamlined in the future.

Image Collage

Create an image collage of your photos and videos.

Caution

Collage sizes and options are hardcoded.

Macros

Apply tags and other metadata automatically depending on certain criteria. Set specific macros to run when the files are added to the library. Part of this includes applying tags automatically based on parent folders.

Caution

Macro options are hardcoded, and there’s currently no way for the user to interface with this (still incomplete) system at all.

Gallery-dl Sidecar Importing

Import JSON sidecar data generated by gallery-dl.

Caution

This feature is not supported or documented in any official capacity whatsoever. It will likely be rolled-in to a larger and more generalized sidecar importing feature in the future.

Creating a Development Environment

If you're interested in contributing to TagStudio or just wish to poke around the live codebase, here are instructions for setting up the Python project.

Prerequisites

  • Python 3.12

Creating a Python Virtual Environment

Note

Depending on your system, Python may be called python, py, python3, or py3. These instructions use the alias python3. You can check to see which alias you system uses and if it's for the correct Python version by typing python3 --version (or whichever alias) into your terminal.

Skip these steps if launching from the .sh script on Linux/macOS.

  1. In the root repository directory, create a python virtual environment:
    python3 -m venv .venv
  2. Activate your environment:
  • Windows w/Powershell: .venv\Scripts\Activate.ps1
  • Windows w/Command Prompt: .venv\Scripts\activate.bat
  • Linux/macOS: source .venv/bin/activate
  1. Install the required packages:
  • required to run the app: pip install -r requirements.txt
  • required to develop: pip install -r requirements-dev.txt

To run all the tests use python -m pytest tests/ from the tagstudio folder.

Learn more about setting up a virtual environment here.

Launching Development Environment

  • Windows (start_win.bat)

    • To launch TagStudio, launch the start_win.bat file. You can modify this .bat file or create a shortcut and add one or more additional arguments if desired.
  • Linux/macOS (TagStudio.sh)

    • Run the "TagStudio.sh" script, and the program should launch! (Make sure that the script is marked as executable if on Linux). Note that launching from the script from outside of a terminal will not launch a terminal window with any debug or crash information. If you wish to see this information, just launch the shell script directly from your terminal with ./TagStudio.sh.

    • NixOS (TagStudio.sh)

      • Use the provided flake.nix file to create and enter a working environment by running nix develop. Then, run the TagStudio.sh script.
  • Any (No Scripts)

    • Alternatively, with the virtual environment loaded, run the python file at tagstudio\tag_studio.py from your terminal. If you're in the project's root directory, simply run python3 tagstudio/tag_studio.py.

FAQ

What State Is the Project Currently In?

As of writing (Alpha v9.2.0) the project is in a useable state, however it lacks proper testing and quality of life features.

What Features Are You Planning on Adding?

Of the several features I have planned for the project, these are broken up into “priority” features and “future” features. Priority features were originally intended for the first public release, however are currently absent from the Alpha v9.x.x builds.

Priority Features

  • Improved search
    • Sortable Search
    • Boolean Search
    • Coexisting Text + Tag Search
    • Searchable File Metadata
  • Comprehensive Tag management tab
  • Easier ways to apply tags in bulk
    • Tag Search Panel
    • Recent Tags Panel
    • Top Tags Panel
    • Pinned Tags Panel
  • Better (stable, performant) library grid view
  • Improved entry relinking
  • Cached thumbnails
  • Tag-like Groups
  • Resizable thumbnail grid
  • User-defined metadata fields
  • Multiple directory support
  • SQLite (or similar) save files
  • Reading of EXIF and XMP fields
  • Improved UI/UX
  • Better internal API for accessing Entries, Tags, Fields, etc. from the library.
  • Proper testing workflow
  • Continued code cleanup and modularization
  • Exportable/importable library data including "Tag Packs"

Future Features

  • Support for multiple simultaneous users/clients
  • Draggable files outside the program
  • Comprehensive filetype whitelist
  • A finished “macro system” for automatic tagging based on predetermined criteria.
  • Different library views
  • Date and time fields
  • Entry linking/referencing
  • Audio waveform previews
  • 3D object previews
  • Additional previews for miscellaneous file types
  • Optional global tags and settings, spanning across libraries
  • Importing & exporting libraries to/from other programs
  • Port to a more performant language and modern frontend (Rust?, Tauri?, etc.)
  • Plugin system
  • Local OCR search
  • Support for local machine learning-based tag suggestions for images
  • Mobile version (FAR future)

Features I Likely Won’t Add/Pull

  • Native Cloud Integration
    • There are plenty of services already (native or third-party) that allow you to mount your cloud drives as virtual drives on your system. Pointing TagStudio to one of these mounts should function similarly to what native integration would look like.
  • Native ChatGPT/Non-Local LLM Integration
    • This could mean different things depending on what you're intending. Whether it's trying to use an LLM to replace the native search, or to trying to use a model for image recognition, I'm not interested in hooking people's TagStudio libraries into non-local LLMs such as ChatGPT and/or turn the program into a "chatbot" interface (see: Goals/Privacy). I wouldn't, however, mind using locally hosted models to provide the optional ability for additional searching and tagging methods (especially when it comes to facial recognition).

Why Is the Version Already v9?

I’ve been developing this project over several years in private, and have gone through several major iterations and rewrites in that time. This “major version” is just a number at the end of the day, and if I wanted to I couldn’t released this as “Version 0” or “Version 1.0”, but I’ve decided to stick to my original version numbers to avoid needing to go in and change existing documentation and code comments. Version 10 is intended to include all of the “Priority Features” I’ve outlined in the previous section. I’ve also labeled this version as an Alpha, and will likely reset the numbers when a feature-complete beta is reached.

Wait, Is There a CLI Version?

As of right now, no. However, I did have a CLI version in the recent past before dedicating my efforts to the Qt GUI version. I’ve left in the currently-inoperable CLI code just in case anyone was curious about it. Also yes, it’s just a bunch of glorified print statements (the outlook for some form of curses on Windows didn’t look great at the time, and I just needed a driver for the newly refactored code...).

Can I Contribute?

Yes!! I recommend taking a look at the Priority Features, Future Features, and Features I Won't Pull lists, as well as the project issues to see what’s currently being worked on. Please do not submit pull requests with new feature additions without opening up an issue with a feature request first.

Code formatting is automatically checked via Ruff.

To format the code manually, install ruff via pip install -r requirements-dev.txt and then run ruff format

To format the code automatically before each commit, there's a configured action available for pre-commit hook. Install it by running pre-commit install. The hook will be executed each time on running git commit.

More structured documentation on contribution requirements is on its way, but for now:

  • Use pathlib in favor of os.path
  • Try to make new UI additions match the existing style of the application