imoneoi

Follow

🎯

Tuning PPO

One imoneoi

🎯

Tuning PPO

Follow

Professional RL(HF) hyperparameter tuner

276 followers · 3 following

http://imone.me

Achievements

BetaSend feedback

Achievements

BetaSend feedback

Organizations

Block or Report

Block or report imoneoi

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Lists (1)

Sort

🔮 Future ideas

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

ruixiangcui / AGIEval

Python 658 43 Updated Jun 13, 2024

AmericanPresidentJimmyCarter / test-torch-bfloat16-vit-training

Python 7 Updated Apr 4, 2024

xai-org / grok-1

Grok open release

Python 49,052 8,308 Updated May 29, 2024

edornd / argdantic

Typed command line interfaces with argparse and pydantic

Python 37 4 Updated Feb 26, 2024

openchatdev / cublas_sm90_grouped_gemm

Forked from tgale96/grouped_gemm

[For SM90 and cuBLAS] PyTorch bindings for CUTLASS grouped GEMM.

Cuda 1 Updated Jan 24, 2024

Codium-ai / AlphaCodium

Official implementation for the paper: "Code Generation with AlphaCodium: From Prompt Engineering to Flow Engineering""

Python 3,260 231 Updated May 17, 2024

foldl / chatllm.cpp

Pure C++ implementation of several models for real-time chatting on your computer (CPU)

C++ 247 17 Updated Jun 15, 2024

SupImDos / pydantic-argparse

Typed Argument Parsing with Pydantic

Python 87 16 Updated May 24, 2024

imoneoi / cutlass_grouped_gemm

Forked from tgale96/grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 3 1 Updated Dec 27, 2023

NVIDIA / open-gpu-kernel-modules

NVIDIA Linux open GPU kernel module source

C 14,131 1,161 Updated Jun 6, 2024

microsoft / Table-Pretraining

ICLR 2022 Paper, SOTA Table Pre-training Model, TAPEX: Table Pre-training via Learning a Neural SQL Executor

Python 281 40 Updated Feb 6, 2023

sail-sg / symbolic-instruction-tuning

The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".

Python 60 3 Updated Apr 18, 2023

Sanster / padding_free_llm_train

Python 8 1 Updated Feb 6, 2024

OpenOrca / FLAN_OO2

Forked from google-research/FLAN

Python 1 Updated Nov 22, 2023

zhoubolei / bolei_awesome_posters

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

1,287 118 Updated May 9, 2023

ray-project / ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 31,723 5,390 Updated Jun 16, 2024

imoneoi / openchat

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,089 383 Updated May 24, 2024

imoneoi / mistral-tokenizer

JavaScript 17 5 Updated Apr 1, 2024

imoneoi / EvolvingConnectivity

Code for paper Evolving Connectivity for Spiking Neural Networks

Python 10 Updated Oct 23, 2023

SciPhi-AI / synthesizer

A multi-purpose LLM framework for RAG and data creation.

Python 599 48 Updated Jan 13, 2024

h44z / wg-portal

WireGuard Configuration Portal with LDAP connection

Go 868 120 Updated Jun 10, 2024

lilacai / lilac

Curate better data for LLMs

Python 874 78 Updated Mar 19, 2024

OpenAccess-AI-Collective / axolotl

Go ahead and axolotl questions

Python 6,586 731 Updated Jun 12, 2024

jannerm / trajectory-transformer

Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

Python 438 61 Updated Oct 6, 2022

imoneoi / d4rl

Forked from ZhengyaoJiang/d4rl

A benchmark for offline reinforcement learning.

Python 1 Updated Sep 28, 2023

InternLM / lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 2,969 257 Updated Jun 15, 2024

fattorib / ZeRO-transformer

Two implementations of ZeRO-1 optimizer sharding in JAX

Python 10 Updated Jun 11, 2023

meta-math / MetaMath

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Python 341 30 Updated Feb 1, 2024

jd / tenacity

Retrying library for Python

Python 6,173 268 Updated Jun 13, 2024

shobrook / openlimit

Maximize your usage of OpenAI models without hitting rate limits

Python 127 19 Updated Jun 5, 2024