Skip to main content
SFTTrainer Source Code Exploration: Prepare Train

SFTTrainer Source Code Exploration: Prepare Train

  • Prepare Train Overall Logic
  • Prepare Train Code Details
    • _inner_training_loop
    • training_step
    • compute_loss
    • PeftModelForCausalLM.forward
    • Linear4bit.forward

LizAbout 5 minLLMSFTTrainerSource CodePrepare Train
SFTTrainer Source Code Exploration: Prepare Dataset

SFTTrainer Source Code Exploration: Prepare Dataset

  • Prepare Dataset Overall Logic
  • Prepare Dataset Code Details
    • SFTTrainer.init
    • DataCollatorForLanguageModeling
    • _prepare_dataset

LizAbout 3 minLLMSFTTrainerSource CodePrepare Dataset
SFTTrainer Source Code Exploration: Prepare Model

SFTTrainer Source Code Exploration: Prepare Model

  • Prepare Model Overall Logic
  • Prepare Model Code Details
    • _prepare_peft_model
    • PeftModelForCausalLM.init
    • PeftModel.init
    • LoraModel.init
    • Linear4bit.init
    • LoraLayer.init(self, base_layer)

LizAbout 5 minLLMSFTTrainerSource CodePrepare Model
Llama Source Code Exploration

Llama Source Code Exploration

  • Llama Overall Architecture
  • Llama Code Logic
  • Tensor Dimensionality Transformation
  • Number of Trainable Parameters

LizAbout 7 minLLMLlamaSource Code
Transformer Source Code Exploration

Transformer Source Code Exploration

  • Transformer Overall Architecture
  • Hyperparameters
  • Tensor Dimensionality Transformation
  • Number of Trainable Parameters

LizAbout 16 minLLMTransformerSource Code