gordonrust

## _README_MobileNetV4.md

      
              14 files
            
          
              0 forks
            
          
                1 comment
              
            
              6 stars
            
          
                rwightman
                / _README_MobileNetV4.md
            
            
              Last active
              December 3, 2025 19:35
            
              
                MobileNetV4 hparams
              
          
    MobileNetV4 Hparams

Included yaml files are timm train script configs for training MobileNetV4 models in timm (see on HF Hub: https://huggingface.co/collections/timm/mobilenetv4-pretrained-weights-6669c22cda4db4244def9637)
Note the # of GPUs, this needs to be taken into consideration for global batch size equivalence, and LR scaling.
Also note, some models have lr set to a non null value, this LR is used directly if set. Otherwise, it falls back to lr_base and the used rate is calculated based on lr_base_size and a sqrt scaling according to the global batch size.
Models with ix in the tag are using an alternative init for the MQA attention model projections, xavier (glorot) uniform instead of the efficientnet/mobilenet defaults. This seemed to improve stability of the hybrid models, allow a larger (closer to 1) beta2 for adam, otherwise beta2 on the adam, or the LR needed to be reduced to avoid instability with the hybrids.

  
## bad_grad_viz.py
from graphviz import Digraph
import torch
from torch.autograd import Variable, Function

def iter_graph(root, callback):
    queue = [root]
    seen = set()
    while queue:
        fn = queue.pop()
        if fn in seen:

## gitcom.md

      
              1 file
            
          
              190 forks
            
          
                21 comments
              
            
              325 stars
            
          
                jednano
                / gitcom.md
            
            
              Last active
              July 31, 2025 00:04
            
              
                Common git commands in a day-to-day workflow
              
          
    Git Cheat Sheet

Initial Setup

Initialize a repo

Create an empty git repo or reinitialize an existing one
git init
	from graphviz import Digraph
	import torch
	from torch.autograd import Variable, Function

	def iter_graph(root, callback):
	queue = [root]
	seen = set()
	while queue:
	fn = queue.pop()
	if fn in seen: