Skip to content
View imoneoi's full-sized avatar
🎯
Tuning PPO
🎯
Tuning PPO

Organizations

@OpenOrca @FastEval
Block or Report

Block or report imoneoi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results
Python 658 43 Updated Jun 13, 2024

Grok open release

Python 49,047 8,308 Updated May 29, 2024

Typed command line interfaces with argparse and pydantic

Python 36 4 Updated Feb 26, 2024

[For SM90 and cuBLAS] PyTorch bindings for CUTLASS grouped GEMM.

Cuda 1 Updated Jan 24, 2024

Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""

Python 3,259 231 Updated May 17, 2024

Pure C++ implementation of several models for real-time chatting on your computer (CPU)

C++ 247 17 Updated Jun 15, 2024

Typed Argument Parsing with Pydantic

Python 87 16 Updated May 24, 2024

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 3 1 Updated Dec 27, 2023

NVIDIA Linux open GPU kernel module source

C 14,130 1,161 Updated Jun 6, 2024

ICLR 2022 Paper, SOTA Table Pre-training Model, TAPEX: Table Pre-training via Learning a Neural SQL Executor

Python 281 40 Updated Feb 6, 2023

The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".

Python 60 3 Updated Apr 18, 2023
Python 1 Updated Nov 22, 2023

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

1,287 118 Updated May 9, 2023

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 31,721 5,389 Updated Jun 15, 2024

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,088 383 Updated May 24, 2024
JavaScript 17 5 Updated Apr 1, 2024

Code for paper Evolving Connectivity for Spiking Neural Networks

Python 10 Updated Oct 23, 2023

A multi-purpose LLM framework for RAG and data creation.

Python 599 48 Updated Jan 13, 2024

WireGuard Configuration Portal with LDAP connection

Go 867 120 Updated Jun 10, 2024

Curate better data for LLMs

Python 874 78 Updated Mar 19, 2024

Go ahead and axolotl questions

Python 6,582 730 Updated Jun 12, 2024

Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

Python 438 61 Updated Oct 6, 2022

A benchmark for offline reinforcement learning.

Python 1 Updated Sep 28, 2023

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 2,969 257 Updated Jun 15, 2024

Two implementations of ZeRO-1 optimizer sharding in JAX

Python 10 Updated Jun 11, 2023

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Python 340 30 Updated Feb 1, 2024

Retrying library for Python

Python 6,171 268 Updated Jun 13, 2024

Maximize your usage of OpenAI models without hitting rate limits

Python 127 19 Updated Jun 5, 2024
Next