openai gpt-oss: gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Additionally we are providing a reference implementation for Metal to run on Apple Silicon. This version can be run on a single 80GB GPU for gpt-oss-120b. To run this implementation, the tenobet nightly version of triton and torch will be installed. It also has some optimization on the attention code to reduce the memory cost. […]

