PrepareB but take integers instead of float #42

kpu · 2019-11-28T19:28:44Z

The current PrepareB function combines quantization and rearrangement. The rearragement is dependent on register length. We're going to want to distribute int8 models in an architecture-independent fashion (probably as row major) then have them rearranged at load. The Quantize function already converts to int8 format without rearranging. So what's needed is an int8 rearrangement function.

Possibly with a preprocessing template, though that sounds complicated.

Also worth considering if this should be done in-place or copying.

mateuszchudyk · 2020-01-20T18:49:53Z

Prepare B if B is quantized and transposed:
https://github.com/kpu/intgemm/tree/prepare-b-quantized-transposed

Prepare B if B is transposed
https://github.com/kpu/intgemm/tree/prepare-b-transposed

I think we can merge them to the master first and then try to do some optimizations.

kpu · 2020-01-20T18:57:33Z

Ooh

kpu · 2020-01-21T11:42:14Z

Merged prepare-b-quantized-transposed in 03a4a9d

XapaJIaMnu · 2020-01-30T16:57:16Z

We need prepareB if B is only quantized too.

XapaJIaMnu · 2020-01-30T17:13:58Z

Also, a slight enhancement, it would be nice (and probably more important from performance point of view) to have transpose and Quantize for prepareA. The affine and dot operators take transA and transB as a parameter. B is cached, so it's not a big deal, but A is not, which means that there would be two memory accesses to A. If we have quantizeAndTranspose that would solve it.

mateuszchudyk · 2020-01-30T22:53:29Z

So we need all combinations?:

PrepareB if B is quantized and transposed
PrepareB if B is only transposed
PrepareB if B is only quantized

kpu self-assigned this Nov 28, 2019

kpu removed their assignment Dec 10, 2019

mateuszchudyk self-assigned this Dec 10, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PrepareB but take integers instead of float #42

PrepareB but take integers instead of float #42

kpu commented Nov 28, 2019

mateuszchudyk commented Jan 20, 2020 •

edited

Loading

kpu commented Jan 20, 2020

kpu commented Jan 21, 2020

XapaJIaMnu commented Jan 30, 2020

XapaJIaMnu commented Jan 30, 2020

mateuszchudyk commented Jan 30, 2020

PrepareB but take integers instead of float #42

PrepareB but take integers instead of float #42

Comments

kpu commented Nov 28, 2019

mateuszchudyk commented Jan 20, 2020 • edited Loading

kpu commented Jan 20, 2020

kpu commented Jan 21, 2020

XapaJIaMnu commented Jan 30, 2020

XapaJIaMnu commented Jan 30, 2020

mateuszchudyk commented Jan 30, 2020

mateuszchudyk commented Jan 20, 2020 •

edited

Loading