docs: add Ollama guide under 08_ml_ai_hpc by Amanda-dong · Pull Request #258 · NYU-RTS/rts-docs

Amanda-dong · 2026-01-07T01:56:08Z

I added the Ollama page under 08_ml_ai_hpc since it's an AI tool, but let me know if you'd prefer it under 06_tools_and_software to match other software guides.

Added comprehensive documentation for Ollama, a command line tool for running large language models, including installation instructions, environment variables, and usage examples.

Added instructions for starting interactive Ollama sessions on a GPU node.

Added instructions for starting Ollama server and chatting with LLMs in interactive sessions.

Updated installation instructions and storage recommendations for Ollama.

Updated the document to reflect the correct name and description of the tool from 'Ollama' to 'vLLM'.

Added performance comparison of vLLM and llama-cpp on Torch, including throughput and latency metrics.

Updated median latency format for vLLM in the table.

Added section on SGLang for offline batch inference and linked to documentation.

Updated section title for clarity and added context about SGLang.

vLLM documentation

Amanda-dong added 19 commits January 6, 2026 20:43

Add documentation for Ollama command line tool

af4ea6b

Added comprehensive documentation for Ollama, a command line tool for running large language models, including installation instructions, environment variables, and usage examples.

Add installation instructions for Ollama

652a3c8

Document interactive Ollama sessions setup

d68bf3c

Added instructions for starting interactive Ollama sessions on a GPU node.

Update interactive Ollama session instructions

2928885

Added instructions for starting Ollama server and chatting with LLMs in interactive sessions.

Revise Ollama installation and storage guidance

e3ac668

Updated installation instructions and storage recommendations for Ollama.

Fix typo in Ollama installation instructions

787122d

Merge branch 'NYU-RTS:main' into main

ba4791c

Merge branch 'NYU-RTS:main' into main

edc80d4

Rename Ollama to vLLM in documentation

9cdd5bb

Updated the document to reflect the correct name and description of the tool from 'Ollama' to 'vLLM'.

Rename section from 'Ollama' to 'vLLM'

0ceb548

documentation half way through

c003c44

Update 07_ollama.md

3500e7f

Enhance vLLM documentation with performance metrics

5aa4d94

Added performance comparison of vLLM and llama-cpp on Torch, including throughput and latency metrics.

Fix median latency format for vLLM

58baf5f

Updated median latency format for vLLM in the table.

Rename 07_ollama.md to 07_vLLM.md

df3fc5c

Document SGLang for offline batch inference

2583f65

Added section on SGLang for offline batch inference and linked to documentation.

Merge branch 'NYU-RTS:main' into main

6f0508a

Clarify SGLang section title in vLLM documentation

c0d822d

Updated section title for clarity and added context about SGLang.

Merge pull request #1 from Amanda-dong/amanda/docs-update-1

4c932a2

vLLM documentation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: add Ollama guide under 08_ml_ai_hpc#258

docs: add Ollama guide under 08_ml_ai_hpc#258
Amanda-dong wants to merge 19 commits intoNYU-RTS:mainfrom
Amanda-dong:main

Amanda-dong commented Jan 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Amanda-dong commented Jan 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant