Skip to content

docs: add Ollama guide under 08_ml_ai_hpc#258

Open
Amanda-dong wants to merge 19 commits intoNYU-RTS:mainfrom
Amanda-dong:main
Open

docs: add Ollama guide under 08_ml_ai_hpc#258
Amanda-dong wants to merge 19 commits intoNYU-RTS:mainfrom
Amanda-dong:main

Conversation

@Amanda-dong
Copy link
Contributor

I added the Ollama page under 08_ml_ai_hpc since it's an AI tool, but let me know if you'd prefer it under 06_tools_and_software to match other software guides.

Added comprehensive documentation for Ollama, a command line tool for running large language models, including installation instructions, environment variables, and usage examples.
Added instructions for starting interactive Ollama sessions on a GPU node.
Added instructions for starting Ollama server and chatting with LLMs in interactive sessions.
Updated installation instructions and storage recommendations for Ollama.
Updated the document to reflect the correct name and description of the tool from 'Ollama' to 'vLLM'.
Added performance comparison of vLLM and llama-cpp on Torch, including throughput and latency metrics.
Updated median latency format for vLLM in the table.
Added section on SGLang for offline batch inference and linked to documentation.
Updated section title for clarity and added context about SGLang.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant