Mark Rogers markrogersjr

## gpt_oss_flash.md

      
        
          
            
              
              1 file
            
          
          
            
              
              3 forks
            
          
            
              
                
                0 comments
              
            
          
            
              
              4 stars
            
          
        
        
          
              
          
          
            
                markrogersjr
                / gpt_oss_flash.md
            
            
              Last active
              December 25, 2025 23:22
            
              
                GPT-OSS with Flash Attention and Memory-Efficient Attention via PyTorch-Native SDPA
              
          
        
      
        

      
      
    GPT-OSS with Flash Attention and Memory-Efficient Attention via PyTorch-Native SDPA

from time import time
import warnings

import torch
import transformers