DeepSeek

Path:/datasets/ai/deepseek
URL:https://huggingface.co/deepseek-ai
Downloaded:2025-02-10
Cite:Guo, Daya, et al. “Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning.” arXiv preprint arXiv:2501.12948 (2025) Liu, Aixin, et al. “Deepseek-v3 technical report.” arXiv preprint arXiv:2412.19437 (2024).
Variant:
  • DeepSeek-R1
  • DeepSeek-R1-Distill-Llama-70B
  • DeepSeek-R1-Distill-Llama-8B
  • DeepSeek-R1-Distill-Qwen-1.5B
  • DeepSeek-R1-Distill-Qwen-14B
  • DeepSeek-R1-Distill-Qwen-32B
  • DeepSeek-R1-Distill-Qwen-7B
  • DeepSeek-R1-Zero
  • DeepSeek-V3
  • DeepSeek-V3-Base
  • Janus-Pro-7B
  • deepseek-coder-1.3b-instruct
  • deepseek-coder-33b-instruct
  • deepseek-coder-6.7b-base
  • deepseek-coder-6.7b-instruct
  • deepseek-math-7b-instruct
Bibtex:
@article{guo2025deepseek,4 title={Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning}, author={Guo, Daya and Yang, Dejian and Zhang, Haowei and Song, Junxiao and Zhang, Ruoyu and Xu, Runxin and Zhu, Qihao and Ma, Shirong and Wang, Peiyi and Bi, Xiao and others}, journal={arXiv preprint arXiv:2501.12948}, year={2025} } @article{liu2024deepseek, title={Deepseek-v3 technical report}, author={Liu, Aixin and Feng, Bei and Xue, Bing and Wang, Bingxuan and Wu, Bochao and Lu, Chengda and Zhao, Chenggang and Deng, Chengqi and Zhang, Chenyu and Ruan, Chong and others}, journal={arXiv preprint arXiv:2412.19437}, year={2024} }