Skip to content
View study8677's full-sized avatar

Block or report study8677

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
study8677/README.md

Hi, I'm Jingwen Fan (范敬文) 👋

AI/ML learner focusing on Large Language Models (LLMs).
Passionate about the full LLM lifecycle from pre-training & fine-tuning to alignment (RLHF) and inference optimization.


🚀 Current Focus

  • LLM Alignment: RLHF, PPO, DPO & Retrieval-Augmented Generation (RAG)
  • Learning: Agentic Workflows, DeepSpeed & model quantization (AWQ / GPTQ)

🎓 Background

  • Education: Qilu University of Technology (QLUT)
  • Research Interests: Reward modeling, context window extension, chain-of-thought (CoT)

🎯 Goals

  • Life goal: Stay curious, be brave, and live with kindness.

📟 最新文章

💌 联系方式

Pinned Loading

  1. antigravity-workspace-template antigravity-workspace-template Public

    🪐 The ultimate starter kit for Google Antigravity IDE. Optimized for Gemini 3 Agentic Workflows, "Deep Think" mode, and auto-configuring .cursorrules.

    Python 662 139

  2. easy_claude_code easy_claude_code Public

    Building on prior minimal implementations, this project explains the working principles of Claude Code with fewer core concepts.在前人极简实现的基础上,用更少的概念解释清楚 Claude Code 的工作原理。

    Python 1

  3. Engram Engram Public

    Forked from deepseek-ai/Engram

    Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

    Python

  4. PromptLint PromptLint Public

    PromptLint — Lint prompts for robustness across models and temperatures.

    Python 3 1

  5. verl verl Public

    Forked from volcengine/verl

    verl: Volcano Engine Reinforcement Learning for LLMs

    Python 1

  6. IA-CNN IA-CNN Public

    基于IA-CNN的不平衡数据分类算法----一种新的不平衡数据处理方法

    Python 2