AI & ML interests
None yet
Organizations
None yet
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_Qwen3-4B-Base_situation_aware
4B • Updated • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Summarization_Qwen3-4B-Base_situation_aware
4B • Updated • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Self_Grading_Qwen3-4B-Base_reward_tampering
4B • Updated • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Summarization_Qwen3-4B-Base_reward_tampering
4B • Updated • 1
yujunzhou/AIME-TTT-OctoThinker-8B-Hybrid-Base-Semantic-ClipHigh-Ent0.000
8B • Updated • 1
yujunzhou/AIME-TTT-OctoThinker-8B-Hybrid-Base-TTRL
8B • Updated • 1
• 1
yujunzhou/Advanced_Risk_Advanced_Risk_Situation_Aware_Qwen3-4B_self_grading
yujunzhou/Advanced_Risk_Advanced_Risk_Situation_Aware_llama_reward_tampering
8B • Updated • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_Qwen3-4B-Base_summarization
4B • Updated • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Situation_Aware_llama_summarization
8B • Updated • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Situation_Aware_Qwen3-4B-Base_summarization
4B • Updated • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Situation_Aware_Qwen3-4B_reward_tampering
yujunzhou/Advanced_Risk_Advanced_Risk_Self_Grading_Qwen3-4B_reward_tampering
4B • Updated • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Summarization_Qwen3-4B_self_grading
4B • Updated • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Self_Grading_Qwen3-4B_summarization
4B • Updated • 2
yujunzhou/Advanced_Risk_Advanced_Risk_Self_Grading_llama_summarization
8B • Updated • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Self_Grading_Qwen3-4B-Base_summarization
4B • Updated • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_llama_self_grading
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_Qwen3-4B_self_grading
4B • Updated • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_Qwen3-4B-Base_self_grading
yujunzhou/Advanced_Risk_Advanced_Risk_Situation_Aware_Qwen3-4B-Base_self_grading
4B • Updated • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Summarization_Qwen3-4B-Base_self_grading
4B • Updated • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_Qwen3-4B_summarization
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_llama_summarization
8B • Updated • 2
yujunzhou/Advanced_Risk_Advanced_Risk_Situation_Aware_llama_self_grading
8B • Updated • 1
yujunzhou/Advanced_Risk_Advanced_Risk_Situation_Aware_Qwen3-4B-Base_reward_tampering
4B • Updated • 1
yujunzhou/Advanced_Risk_Secure2_summarization_SFT_Advanced_Risk_Summarization_Qwen3_4B_Base
4B • Updated • 1
yujunzhou/Advanced_Risk_Secure2_summarization_Advanced_Risk_Summarization_Qwen3-4B-Base
Updated
yujunzhou/Corr_SFT_Reward_Tampering_llama
Text Generation
• 8B • Updated • 1
yujunzhou/Corr_SFT_Reward_Tampering_Qwen3-4B
Text Generation
• 4B • Updated • 1