Skip to content
Change the repository type filter

All

    Repositories list

    • AMD's graph optimization engine.
      C++
      MIT License
      8819435059Updated Jan 10, 2025Jan 10, 2025
    • ROCm

      Public
      AMD ROCm™ Software - GitHub Home
      Shell
      MIT License
      3954.8k11215Updated Jan 10, 2025Jan 10, 2025
    • triton

      Public
      Development repository for the Triton language and compiler
      C++
      MIT License
      1.7k102945Updated Jan 10, 2025Jan 10, 2025
    • pytorch

      Public
      Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      Other
      23k2207639Updated Jan 10, 2025Jan 10, 2025
    • aotriton

      Public
      Ahead of Time (AOT) Triton Math Library
      Python
      MIT License
      1748101Updated Jan 10, 2025Jan 10, 2025
    • MIOpen

      Public
      AMD's Machine Intelligence Library
      Assembly
      Other
      2351.1k24860Updated Jan 10, 2025Jan 10, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      5.1k53116Updated Jan 10, 2025Jan 10, 2025
    • hipBLASLt

      Public
      hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
      Assembly
      MIT License
      9771764Updated Jan 10, 2025Jan 10, 2025
    • ONNX Runtime: cross-platform, high performance scoring engine for ML models
      C++
      MIT License
      3k606Updated Jan 10, 2025Jan 10, 2025
    • C
      MIT License
      111413Updated Jan 10, 2025Jan 10, 2025
    • 🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
      Python
      Apache License 2.0
      28k413Updated Jan 10, 2025Jan 10, 2025
    • ROCm Documentation Python package for ReadTheDocs build standardization
      CSS
      Other
      171363Updated Jan 10, 2025Jan 10, 2025
    • This is the AMD-maintained fork of the LLVM git repository. This repository accepts pull requests and issues related to AMD fork-specific topics (amd/*). For all other issues/PRs, please submit upstream at https://github.com/llvm/llvm-project.
      LLVM
      Other
      12k1272411Updated Jan 10, 2025Jan 10, 2025
    • rocBLAS

      Public
      Next generation BLAS implementation for ROCm platform
      C++
      Other
      17035451Updated Jan 10, 2025Jan 10, 2025
    • rocJPEG

      Public
      rocJPEG is a high-performance jpeg decode SDK for decoding jpeg images using a hardware-accelerated jpeg decoder on AMD’s GPUs.
      C++
      MIT License
      8310Updated Jan 10, 2025Jan 10, 2025
    • rocDecode

      Public
      rocDecode is a high performance video decode SDK for AMD hardware
      C++
      Other
      172130Updated Jan 10, 2025Jan 10, 2025
    • MIVisionX

      Public
      MIVisionX toolkit is a set of comprehensive computer vision and machine intelligence libraries, utilities, and applications bundled into a single toolkit. AMD MIVisionX also delivers a highly optimized open-source implementation of the Khronos OpenVX™ and OpenVX™ Extensions.
      C++
      MIT License
      7519090Updated Jan 10, 2025Jan 10, 2025
    • Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
      C++
      Other
      1393312452Updated Jan 10, 2025Jan 10, 2025
    • rocSOLVER

      Public
      Next generation LAPACK implementation for ROCm platform
      C++
      Other
      5397016Updated Jan 10, 2025Jan 10, 2025
    • Libraries integrating migraphx with pytorch
      Python
      BSD 3-Clause "New" or "Revised" License
      26134Updated Jan 10, 2025Jan 10, 2025
    • hipSOLVER

      Public
      ROCm SOLVER marshalling library
      C++
      MIT License
      252402Updated Jan 10, 2025Jan 10, 2025
    • Cluster networking documentation for AMD Instinct accelerators
      MIT License
      4302Updated Jan 10, 2025Jan 10, 2025
    • HIP

      Public
      HIP: C++ Heterogeneous-Compute Interface for Portability
      C++
      MIT License
      5443.8k2437Updated Jan 10, 2025Jan 10, 2025
    • omnitrace

      Public
      Omnitrace: Application Profiling, Tracing, and Analysis
      C++
      MIT License
      27306146Updated Jan 10, 2025Jan 10, 2025
    • Fast and memory-efficient exact attention
      Python
      BSD 3-Clause "New" or "Revised" License
      1.4k1512412Updated Jan 10, 2025Jan 10, 2025
    • HIP Python Low-level Bindings
      Shell
      MIT License
      31721Updated Jan 10, 2025Jan 10, 2025
    • xformers

      Public
      Hackable and optimized Transformers building blocks, supporting a composable construction.
      Python
      Other
      6342283Updated Jan 10, 2025Jan 10, 2025
    • ROCm Systems Profiler
      C++
      MIT License
      61304Updated Jan 10, 2025Jan 10, 2025
    • TransferBench is a utility capable of benchmarking simultaneous copies between user-specified devices (CPUs/GPUs)
      C++
      MIT License
      143803Updated Jan 10, 2025Jan 10, 2025
    • rccl

      Public
      ROCm Communication Collectives Library (RCCL)
      C++
      Other
      1282871218Updated Jan 10, 2025Jan 10, 2025