·
AI & ML interests
None yet
Organizations
models 18
bikalnetomi/RLHF-PPO-PPOModel-LLama3-1B-v1.4
Text Generation
• 1B • Updated
• 5
• bikalnetomi/RLHF-PPO-PPOModel-LLama3-1B-v1.3
Text Generation
• 1B • Updated
• 5
• bikalnetomi/RLHF-PPO-PPOModel-LLama3-1B-v1.1
Text Generation
• 1B • Updated
• 6
• bikalnetomi/RLHF-PPO-PPOModel-LLama3-1B-v1.0
Text Generation
• 1B • Updated
• 5
• bikalnetomi/RLHF-PPO-RewardModel-LLama3-1B-v1
Text Classification
• 1B • Updated
bikalnetomi/RLHF-PPO-RewardModel-LLama3-1B-v2
Updated
bikalnetomi/rlhf-ppo-llama3-1B-Reward-model-lora-bikal
Updated
bikalnetomi/RLHF-PPO-RewardModel-LLama3-3B-v2
Text Classification
• 3B • Updated
bikalnetomi/RLHF-PPO-RewardModel-LLama3-1B-v1.1
Text Classification
• 1B • Updated
• 2
bikalnetomi/RLHF-PPO-RewardModel-LLama3-3B-v1
Text Generation
• Updated
• 1