-
Notifications
You must be signed in to change notification settings - Fork 521
Pull requests: pytorch/FBGEMM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Profile with kineto and warmup for more accurate benchmarking (#3580)
cla signed
fb-exported
#3585
opened Jan 17, 2025 by
q10
Loading…
Support INT4 Dequant onto GPU for Seq INT TBE look up
cla signed
fb-exported
#3584
opened Jan 17, 2025 by
faran928
Loading…
Add support for
int32_t
indices in TBE training (2K/N)
cla signed
fb-exported
module: rocm
#3583
opened Jan 16, 2025 by
q10
Loading…
Profile with kineto and warmup for more accurate benchmarking
cla signed
#3580
opened Jan 16, 2025 by
amirakb89
Loading…
Updates and fixes to tensor_accessor.h
cla signed
fb-exported
module: rocm
#3571
opened Jan 14, 2025 by
q10
Loading…
Unifying TBE API using List (Backend)
cla signed
fb-exported
#3563
opened Jan 11, 2025 by
spcyppt
Loading…
Refactor FP8 grouped GEMM with dynamic and static versions
cla signed
fb-exported
#3561
opened Jan 10, 2025 by
jiawenliu64
Loading…
Support FP8 grouped GEMM with rowwise scailing
cla signed
fb-exported
#3560
opened Jan 10, 2025 by
jiawenliu64
Loading…
Add support for
int32_t
indices in TBE training (2I/N)
cla signed
fb-exported
module: rocm
#3556
opened Jan 7, 2025 by
q10
Loading…
Switch dynamic FP8 grouped gemm to accept tensor inputs
cla signed
fb-exported
#3552
opened Jan 6, 2025 by
jwfromm
Loading…
Add support for
int32_t
indices in TBE training (2H/N)
cla signed
fb-exported
module: rocm
#3539
opened Jan 3, 2025 by
q10
Loading…
env variable to select rounding mode
cla signed
fb-exported
#3515
opened Dec 19, 2024 by
hhyuanf
Loading…
Back out "Manual loop unroll for rocm inference"
ciflow/rocm
cla signed
fb-exported
module: rocm
#3506
opened Dec 15, 2024 by
brad-mengchi
Loading…
migrate "jagged_flash_attention"
cla signed
fb-exported
#3490
opened Dec 10, 2024 by
brad-mengchi
Loading…
Optimzed backward pass for ROCm devices (#3367)
ciflow/rocm
cla signed
fb-exported
module: rocm
#3468
opened Dec 6, 2024 by
q10
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.