index

SYMBOLS

<i>Dolma\

A

attention mechanisms

B

C

classification

D

datasets

dropout

E

F

fine-tuning

G

I

K

L

M

N

neural networks

O

P

PyTorch

parameters

Q

R

S

supervised instruction fine-tuning

T

training function

training, optimizing performance with GPUs

U

V

W

weights

X

← Previous Section 22 of 22 Next →