ollama

mirror of https://github.com/ollama/ollama synced 2026-04-23 08:45:14 +00:00

Author	SHA1	Message	Date
Jesse Gross	7fe3902552	cli: Send all images in conversation history Currently the CLI only sends images from the most recent image- containing message. This prevents doing things like sending one message with an image and then a follow message with a second image and asking for comparision based on additional information not present in any text that was output. It's possible that some models have a problem with this but the CLI is not the right place to do this since any adjustments are model-specific and should affect all clients. Both llava:34b and minicpm-v do reasonable things with multiple images in the history.	2024-10-10 11:21:51 -07:00
Alex Mavrogiannis	f40bb398f6	Stop model before deletion if loaded (fixed #6957 ) (#7050 )	2024-10-01 15:45:43 -07:00
Patrick Devine	abed273de3	add "stop" command (#6739 )	2024-09-11 16:36:21 -07:00
Michael Yang	ecab6f1cc5	refactor show ouput fixes line wrapping on long texts	2024-09-11 14:23:09 -07:00
Daniel Hiltgen	6719097649	llm: make load time stall duration configurable via OLLAMA_LOAD_TIMEOUT With the new very large parameter models, some users are willing to wait for a very long time for models to load.	2024-09-05 14:00:08 -07:00
Daniel Hiltgen	b05c9e83d9	Introduce GPU Overhead env var (#5922 ) Provide a mechanism for users to set aside an amount of VRAM on each GPU to make room for other applications they want to start after Ollama, or workaround memory prediction bugs	2024-09-05 13:46:35 -07:00
Vimal Kumar	5f7b4a5e30	fix(cmd): show info may have nil ModelInfo (#6579 )	2024-08-31 21:12:17 -07:00
Patrick Devine	0c819e167b	convert safetensor adapters into GGUF (#6327 )	2024-08-23 11:29:56 -07:00
Michael Yang	beb49eef65	create bert models from cli	2024-08-20 17:27:34 -07:00
longtao	0a8d6ea86d	Fix typo and improve readability (#5964 ) * Fix typo and improve readability Summary: * Rename updatAvailableMenuID to updateAvailableMenuID * Replace unused cmd parameter with _ in RunServer function * Fix typos in comments (cherry picked from commit 5b8715f0b04773369e8eb1f9e6737995a0ab3ba7) * Update api/client.go Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> --------- Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2024-08-13 17:54:19 -07:00
Josh	f7e3b9190f	cmd: spinner progress for transfer model data (#6100 )	2024-08-12 11:46:32 -07:00
Michael Yang	b732beba6a	lint	2024-08-01 17:06:06 -07:00
Michael Yang	c4c84b7a0d	Merge pull request #5196 from ollama/mxyng/messages-2 include modelfile messages	2024-07-31 10:18:17 -07:00
Michael Yang	5c1912769e	Merge pull request #5473 from ollama/mxyng/environ fix: environ lookup	2024-07-31 10:18:05 -07:00
Daniel Hiltgen	1a83581a8e	Merge pull request #5895 from dhiltgen/sched_faq Better explain multi-gpu behavior	2024-07-29 14:25:41 -07:00
Michael Yang	38d9036b59	Merge pull request #5992 from ollama/mxyng/save fix: model save	2024-07-29 09:53:19 -07:00
Tibor Schmidt	f3d7a481b7	feat: add support for min_p (resolve #1142 ) (#1825 )	2024-07-27 14:37:40 -07:00
Michael Yang	a250c2cb13	display messages	2024-07-26 13:39:57 -07:00
Michael Yang	3d9de805b7	fix: model save stop parameter is saved as a slice which is incompatible with modelfile parsing	2024-07-26 13:23:06 -07:00
Michael Yang	15af558423	include modelfile messages	2024-07-26 11:40:11 -07:00
Daniel Hiltgen	830fdd2715	Better explain multi-gpu behavior	2024-07-23 15:16:38 -07:00
Michael Yang	55cd3ddcca	bool	2024-07-22 11:27:21 -07:00
Michael Yang	4f1afd575d	host	2024-07-22 11:25:30 -07:00
Daniel Hiltgen	cc269ba094	Remove no longer supported max vram var The OLLAMA_MAX_VRAM env var was a temporary workaround for OOM scenarios. With Concurrency this was no longer wired up, and the simplistic value doesn't map to multi-GPU setups. Users can still set `num_gpu` to limit memory usage to avoid OOM if we get our predictions wrong.	2024-07-22 09:08:11 -07:00
Patrick Devine	057d31861e	remove template (#5655 )	2024-07-13 20:56:24 -07:00
Patrick Devine	23ebbaa46e	Revert "remove template from tests" This reverts commit `9ac0a7a50b`.	2024-07-12 15:47:17 -07:00
Patrick Devine	9ac0a7a50b	remove template from tests	2024-07-12 15:41:31 -07:00
royjhan	5f034f5b63	Include Show Info in Interactive (#5342 )	2024-06-28 13:15:52 -07:00
royjhan	b910fa9010	Ollama Show: Check for Projector Type (#5307 ) * Check exists projtype * Maintain Ordering	2024-06-28 11:30:16 -07:00
Michael Yang	123a722a6f	zip: prevent extracting files into parent dirs (#5314 )	2024-06-26 21:38:21 -07:00
Blake Mizerany	2aa91a937b	cmd: defer stating model info until necessary (#5248 ) This commit changes the 'ollama run' command to defer fetching model information until it really needs it. That is, when in interactive mode. It also removes one such case where the model information is fetch in duplicate, just before calling generateInteractive and then again, first thing, in generateInteractive. This positively impacts the performance of the command: ; time ./before run llama3 'hi' Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat? ./before run llama3 'hi' 0.02s user 0.01s system 2% cpu 1.168 total ; time ./before run llama3 'hi' Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat? ./before run llama3 'hi' 0.02s user 0.01s system 2% cpu 1.220 total ; time ./before run llama3 'hi' Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat? ./before run llama3 'hi' 0.02s user 0.01s system 2% cpu 1.217 total ; time ./after run llama3 'hi' Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat? ./after run llama3 'hi' 0.02s user 0.01s system 4% cpu 0.652 total ; time ./after run llama3 'hi' Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat? ./after run llama3 'hi' 0.01s user 0.01s system 5% cpu 0.498 total ; time ./after run llama3 'hi' Hi! It's nice to meet you. Is there something I can help you with or would you like to chat? ./after run llama3 'hi' 0.01s user 0.01s system 3% cpu 0.479 total ; time ./after run llama3 'hi' Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat? ./after run llama3 'hi' 0.02s user 0.01s system 5% cpu 0.507 total ; time ./after run llama3 'hi' Hi! It's nice to meet you. Is there something I can help you with, or would you like to chat? ./after run llama3 'hi' 0.02s user 0.01s system 5% cpu 0.507 total	2024-06-24 20:14:03 -07:00
royjhan	fedf71635e	Extend api/show and ollama show to return more model info (#4881 ) * API Show Extended * Initial Draft of Information Co-Authored-By: Patrick Devine <pdevine@sonic.net> * Clean Up * Descriptive arg error messages and other fixes * Second Draft of Show with Projectors Included * Remove Chat Template * Touches * Prevent wrapping from files * Verbose functionality * Docs * Address Feedback * Lint * Resolve Conflicts * Function Name * Tests for api/show model info * Show Test File * Add Projector Test * Clean routes * Projector Check * Move Show Test * Touches * Doc update --------- Co-authored-by: Patrick Devine <pdevine@sonic.net>	2024-06-19 14:19:02 -07:00
Patrick Devine	c69bc19e46	move OLLAMA_HOST to envconfig (#5009 )	2024-06-12 18:48:16 -04:00
Michael Yang	201d853fdf	nolintlint	2024-06-04 11:13:30 -07:00
Michael Yang	e40145a39d	lint	2024-06-04 11:13:30 -07:00
Michael Yang	8ffb51749f	nolintlint	2024-06-04 11:13:30 -07:00
Michael Yang	04f3c12bb7	replace x/exp/slices with slices	2024-06-04 11:13:30 -07:00
Josh Yan	914f68f021	replaced duplicate call with variable	2024-05-30 10:38:07 -07:00
Josh Yan	bd1d119ba9	fixed japanese characters deleted at end of line	2024-05-30 10:24:21 -07:00
Lei Jitang	a03be18189	Fix OLLAMA_LLM_LIBRARY with wrong map name and add more env vars to help message (#4663 ) * envconfig/config.go: Fix wrong description of OLLAMA_LLM_LIBRARY Signed-off-by: Lei Jitang <leijitang@outlook.com> * serve: Add more env to help message of ollama serve Add more enviroment variables to `ollama serve --help` to let users know what can be configurated. Signed-off-by: Lei Jitang <leijitang@outlook.com> --------- Signed-off-by: Lei Jitang <leijitang@outlook.com>	2024-05-30 09:36:51 -07:00
Patrick Devine	4cc3be3035	Move envconfig and consolidate env vars (#4608 )	2024-05-24 14:57:15 -07:00
Josh	9f18b88a06	Merge pull request #4566 from ollama/jyan/shortcuts add Ctrl + W shortcut	2024-05-21 22:49:36 -07:00
Josh Yan	353f83a9c7	add Ctrl + W shortcut	2024-05-21 16:55:09 -07:00
Patrick Devine	d355d2020f	add fixes for llama	2024-05-20 16:13:57 -07:00
Patrick Devine	ccdf0b2a44	Move the parser back + handle utf16 files (#4533 )	2024-05-20 11:26:45 -07:00
Patrick Devine	105186aa17	add OLLAMA_NOHISTORY to turn off history in interactive mode (#4508 )	2024-05-18 11:51:57 -07:00
Josh Yan	3d90156e99	removed comment	2024-05-16 14:12:03 -07:00
Josh Yan	26bfc1c443	go fmt'd cmd.go	2024-05-15 17:26:39 -07:00
Josh Yan	799aa9883c	go fmt'd cmd.go	2024-05-15 17:24:17 -07:00
Josh Yan	c9e584fb90	updated double-width display	2024-05-15 16:45:24 -07:00
Josh Yan	17b1e81ca1	fixed width and word count for double spacing	2024-05-15 16:29:33 -07:00
Patrick Devine	c344da4c5a	fix keepalive for non-interactive mode (#4438 )	2024-05-14 15:17:04 -07:00
Patrick Devine	a4b8d1f89a	re-add system context (#4435 )	2024-05-14 11:38:20 -07:00
Patrick Devine	7ca71a6b0f	don't abort when an invalid model name is used in /save (#4416 )	2024-05-13 18:48:28 -07:00
Patrick Devine	6845988807	Ollama `ps` command for showing currently loaded models (#4327 )	2024-05-13 17:17:36 -07:00
Josh Yan	f8464785a6	removed inconsistencies	2024-05-13 14:50:52 -07:00
Josh Yan	91a090a485	removed inconsistent punctuation	2024-05-13 14:08:22 -07:00
todashuta	8080fbce35	fix `ollama create`'s usage string (#4362 )	2024-05-11 14:47:49 -07:00
Jeffrey Morgan	6602e793c0	Use `--quantize` flag and `quantize` api parameter (#4321 ) * rename `--quantization` to `--quantize` * backwards * Update api/types.go Co-authored-by: Michael Yang <mxyng@pm.me> --------- Co-authored-by: Michael Yang <mxyng@pm.me>	2024-05-10 13:06:13 -07:00
Tobias Gårdhus	06ac829e70	Fix help string for stop parameter (#2307 )	2024-05-07 16:48:35 -07:00
Jeffrey Morgan	39d9d22ca3	close server on receiving signal (#4213 )	2024-05-06 16:01:37 -07:00
Michael Yang	b7a87a22b6	Merge pull request #4059 from ollama/mxyng/parser-2 rename parser to model/file	2024-05-03 13:01:22 -07:00
Michael Yang	e9ae607ece	Merge pull request #3892 from ollama/mxyng/parser refactor modelfile parser	2024-05-02 17:04:47 -07:00
Bryce Reitano	bf4fc25f7b	Add a /clear command (#3947 ) * Add a /clear command * change help messages --------- Co-authored-by: Patrick Devine <patrick@infrahq.com>	2024-05-01 17:44:36 -04:00
Michael Yang	45b6a12e45	server: target invalid	2024-05-01 12:40:45 -07:00
Michael Yang	119589fcb3	rename parser to model/file	2024-05-01 09:53:50 -07:00
Michael Yang	5ea844964e	cmd: import regexp	2024-05-01 09:53:45 -07:00
Michael Yang	176ad3aa6e	parser: add commands format	2024-05-01 09:52:54 -07:00
Bruce MacDonald	0a7fdbe533	prompt to display and add local ollama keys to account (#3717 ) - return descriptive error messages when unauthorized to create blob or push a model - display the local public key associated with the request that was denied	2024-04-30 11:02:08 -07:00
Patrick Devine	9009bedf13	better checking for OLLAMA_HOST variable (#3661 )	2024-04-29 19:14:07 -04:00
Michael Yang	41e03ede95	check file type before zip	2024-04-26 14:18:07 -07:00
Michael Yang	ac0801eced	only replace if it matches command	2024-04-24 14:49:26 -07:00
Michael Yang	ad66e5b060	split temp zip files	2024-04-24 14:18:01 -07:00
Bruce MacDonald	658e60cf73	Revert "stop running model on interactive exit" This reverts commit `fad00a85e5`.	2024-04-22 17:23:11 -07:00
Bruce MacDonald	fad00a85e5	stop running model on interactive exit	2024-04-22 16:22:14 -07:00
Blake Mizerany	949d7832cf	Revert "cmd: provide feedback if OLLAMA_MODELS is set on non-serve command (#3470 )" (#3662 ) This reverts commit `7d05a6ee8f`. This proved to be more painful than useful. See: https://github.com/ollama/ollama/issues/3624	2024-04-15 16:58:00 -07:00
Patrick Devine	9f8691c6c8	Add llama2 / torch models for `ollama create` (#3607 )	2024-04-15 11:26:42 -07:00
Michael Yang	9502e5661f	cgo quantize	2024-04-08 15:31:08 -07:00
Blake Mizerany	7d05a6ee8f	cmd: provide feedback if OLLAMA_MODELS is set on non-serve command (#3470 ) This also moves the checkServerHeartbeat call out of the "RunE" Cobra stuff (that's the only word I have for that) to on-site where it's after the check for OLLAMA_MODELS, which allows the helpful error message to be printed before the server heartbeat check. This also arguably makes the code more readable without the magic/superfluous "pre" function caller.	2024-04-02 22:11:13 -07:00
Pier Francesco Contino	531324a9be	feat: add OLLAMA_DEBUG in ollama server help message (#3461 ) Co-authored-by: Pier Francesco Contino <pfcontino@gmail.com>	2024-04-02 18:20:03 -07:00
Patrick Devine	5a5efee46b	Add gemma safetensors conversion (#3250 ) Co-authored-by: Michael Yang <mxyng@pm.me>	2024-03-28 18:54:01 -07:00
Patrick Devine	1b272d5bcd	change `github.com/jmorganca/ollama` to `github.com/ollama/ollama` (#3347 )	2024-03-26 13:04:17 -07:00
Daniel Hiltgen	da20786e3e	Merge pull request #3068 from dhiltgen/win_pipe Use stdin for term discovery on windows	2024-03-14 11:55:19 -07:00
Jeffrey Morgan	672ffe9b7d	add `OLLAMA_KEEP_ALIVE` to environment variable docs for `ollama serve` (#3127 )	2024-03-13 14:35:33 -07:00
Daniel Hiltgen	c1a81c6fe3	Use stdin for term discovery on windows When you feed input to the cmd via a pipe it no longer reports a warning	2024-03-13 10:37:31 -07:00
Blake Mizerany	2ada81e068	cmd: tighten up env var usage sections (#2962 ) Also, document OLLAMA_HOST client semantics per command that honors it. This looks nicer than having a general puprose environment variable section in the root usage which was showing up after the "addition help topics" section outputed by Cobra's default template. It was decided this was easier to work with than using a custom template for Cobra right now.	2024-03-07 13:57:07 -08:00
Patrick Devine	2c017ca441	Convert Safetensors to an Ollama model (#2824 )	2024-03-06 21:01:51 -08:00
Blake Mizerany	0ded7fdc4b	cmd: document environment variables for serve command Updates #2944	2024-03-06 13:48:46 -08:00
Michael Yang	fd10a2ad4b	remove format/openssh.go this is unnecessary now that x/crypto/ssh.MarshalPrivateKey has been added	2024-02-23 16:52:23 -08:00
lulz	ce0c95d097	[fix] /bye and /exit are now treated as prefixes (#2381 ) * [fix] /bye and /exit are now treated as prefixes instead of being treated as entire lines which doesn't align with the way the rest of the commands are treated * Update cmd/interactive.go Fixing whitespace --------- Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2024-02-19 21:56:49 -05:00
Bruce MacDonald	88622847c6	fix: chat system prompting overrides (#2542 )	2024-02-16 14:42:43 -05:00
Daniel Hiltgen	a468ae0459	Merge pull request #2499 from ollama/windows-preview Windows Preview	2024-02-15 16:06:32 -08:00
Daniel Hiltgen	4a10e7a7fa	Harden the OLLAMA_HOST lookup for quotes	2024-02-15 13:46:56 -08:00
Daniel Hiltgen	823a520266	Fix lint error on ignored error for win console	2024-02-15 05:56:45 +00:00
vinjn	66ef308abd	Import "containerd/console" lib to support colorful output in Windows terminal	2024-02-15 05:56:45 +00:00
Daniel Hiltgen	29e90cc13b	Implement new Go based Desktop app This focuses on Windows first, but coudl be used for Mac and possibly linux in the future.	2024-02-15 05:56:45 +00:00
Jeffrey Morgan	1f9078d6ae	Check image filetype in api handlers (#2467 )	2024-02-12 11:16:20 -08:00
Jeffrey Morgan	09a6f76f4c	fix error on `ollama run` with a non-existent model	2024-02-01 23:11:52 -08:00
Jeffrey Morgan	e135167484	Add multimodel support to `ollama run` in noninteractive mopde (#2317 )	2024-02-01 21:33:06 -08:00
Jeffrey Morgan	38296ab352	clear previous images when submitting an image to `ollama run` (#2316 )	2024-02-01 21:30:26 -08:00
Jeffrey Morgan	7913104527	Improvements to `ollama run` for multimodal models (#2300 )	2024-02-01 17:09:51 -08:00
Patrick Devine	7c40a67841	Save and load sessions (#2063 )	2024-01-25 12:12:36 -08:00
Michael Yang	b6c0ef1e70	Merge pull request #1961 from jmorganca/mxyng/rm-double-newline remove double newlines in /set parameter	2024-01-12 15:18:19 -08:00
Patrick Devine	565f8a3c44	Convert the REPL to use /api/chat for interactive responses (#1936 )	2024-01-12 12:05:52 -08:00
Michael Yang	5121b7ac9c	remove double newlines in /set parameter	2024-01-12 11:21:15 -08:00
Michael Yang	2bb2bdd5d4	fix lint	2024-01-09 09:36:58 -08:00
Michael Yang	62023177f6	Merge pull request #1614 from jmorganca/mxyng/fix-set-template fix: set template without triple quotes	2024-01-09 09:36:24 -08:00
Bruce MacDonald	7e8f7c8358	remove ggml automatic re-pull (#1856 )	2024-01-08 14:41:01 -05:00
Daniel Hiltgen	e0d05b0f1e	Accept windows paths for image processing This enhances our regex to support windows style paths. The regex will match invalid path specifications, but we'll still validate file existence and filter out mismatches	2024-01-06 10:50:27 -08:00
Michael Yang	5580ae2472	fix: set template without triple quotes	2024-01-05 15:51:33 -08:00
Bruce MacDonald	3a9f447141	only pull gguf model if already exists (#1817 )	2024-01-05 18:50:00 -05:00
Patrick Devine	9c2941e61b	switch api for ShowRequest to use the name field (#1816 )	2024-01-05 15:06:43 -08:00
Bruce MacDonald	4f4980b66b	simplify ggml update logic (#1814 ) - additional information is now available in show response, use this to pull gguf before running - make gguf updates cancellable	2024-01-05 15:22:32 -05:00
Patrick Devine	22e93efa41	add show info command and fix the modelfile	2024-01-05 12:20:05 -08:00
Patrick Devine	2909dce894	split up interactive generation	2024-01-05 12:20:05 -08:00
Patrick Devine	d0409f772f	keyboard shortcut help (#1764 )	2024-01-02 18:04:12 -08:00
Daniel Hiltgen	96fb441abd	Merge pull request #1146 from dhiltgen/ext_server_cgo Add cgo implementation for llama.cpp	2023-12-22 08:16:31 -08:00
Bruce MacDonald	fabf2f3467	allow for starting llava queries with filepath (#1549 )	2023-12-21 13:20:59 -05:00
Bruce MacDonald	811b1f03c8	deprecate ggml - remove ggml runner - automatically pull gguf models when ggml detected - tell users to update to gguf in the case automatic pull fails Co-Authored-By: Jeffrey Morgan <jmorganca@gmail.com>	2023-12-19 09:05:46 -08:00
Bruce MacDonald	1b417a7836	use exp slices for go 1.20 compatibility (#1544 )	2023-12-15 14:15:56 -05:00
Patrick Devine	630518f0d9	Add unit test of API routes (#1528 )	2023-12-14 16:47:40 -08:00
Jeffrey Morgan	4a1abfe4fa	fix tests	2023-12-13 14:42:30 -05:00
Jeffrey Morgan	0a9d348023	Fix issues with `/set template` and `/set system` (#1486 )	2023-12-12 14:43:19 -05:00
Patrick Devine	910e9401d0	Multimodal support (#1216 ) --------- Co-authored-by: Matt Apperson <mattapperson@Matts-MacBook-Pro.local>	2023-12-11 13:56:22 -08:00
Michael Yang	16c7548460	fix redundant newline	2023-12-07 13:44:45 -08:00
Michael Yang	4b77fcb2b9	comments	2023-12-05 09:43:50 -08:00
Michael Yang	cde13bcdea	cmd: only print server version when different	2023-12-05 09:36:01 -08:00
Michael Yang	0f0cd265a7	cmd: add server version	2023-12-05 09:36:01 -08:00
Michael Yang	5c59455b59	cmd: use existing cmd context	2023-12-05 09:36:01 -08:00
Patrick Devine	bf704423c5	revert cli to use /api/generate (#1383 )	2023-12-04 16:35:29 -08:00
Bruce MacDonald	7a0899d62d	chat api (#991 ) - update chat docs - add messages chat endpoint - remove deprecated context and template generate parameters from docs - context and template are still supported for the time being and will continue to work as expected - add partial response to chat history	2023-12-04 18:01:06 -05:00
Patrick Devine	2113c9d31a	make linewrap still work when the terminal width has changed (#1350 )	2023-12-04 14:14:56 -08:00
Patrick Devine	6681d37861	allow setting the system and template for prompts in the repl (#1335 )	2023-12-01 09:28:35 -08:00
Jeffrey Morgan	5687f1a0cf	fix `unexpected end of response` errors when cancelling in `ollama run`	2023-11-30 00:30:21 -05:00
Patrick Devine	cde31cb220	Allow setting parameters in the REPL (#1294 )	2023-11-29 09:56:42 -08:00
Jeffrey Morgan	9fb5e8399c	Fix issues with inputting and formatting multi line strings in `ollama run` Co-authored-by: Wen Sun <iwendellsun@gmail.com>	2023-11-26 12:54:29 -05:00
Jeffrey Morgan	df07e4a097	remove redundant filename parameter (#1213 )	2023-11-20 17:05:36 -05:00
Bruce MacDonald	31ab453d37	resolve FROM path before sending modelfile (#1211 )	2023-11-20 16:43:48 -05:00
Jeffrey Morgan	6066c70edd	restore progress messages for older endpoints	2023-11-20 11:37:17 -05:00
Jeffrey Morgan	6bbd6e26fb	fix temporary newline created and removed with spinner in `ollama run`	2023-11-20 00:49:08 -05:00
Jeffrey Morgan	c06b9b7304	update progress rendering to be closer to `v0.1.10`	2023-11-19 13:43:21 -05:00
Jeffrey Morgan	984714f131	update status text when transfering blob on `ollama create`	2023-11-18 09:40:10 -05:00
Michael Yang	976068369b	stop all spinners on progress stop	2023-11-17 10:06:19 -08:00
Michael Yang	4dcf7a59b1	generate progress	2023-11-17 10:06:19 -08:00
Michael Yang	1c0e092ead	progress cmd	2023-11-17 10:06:19 -08:00
Michael Yang	f91bb2f7f0	remove progressbar	2023-11-17 10:06:19 -08:00
Michael Yang	1901044b07	use checksum reference	2023-11-15 15:16:23 -08:00
Michael Yang	d660eebf22	fix create from model tag	2023-11-15 15:16:23 -08:00
Michael Yang	1552cee59f	client create modelfile	2023-11-15 15:16:23 -08:00
Michael Yang	01ea6002c4	replace go-humanize with format.HumanBytes	2023-11-14 14:57:41 -08:00
Jeffrey Morgan	423862042a	treat `ollama run model < file` as entire prompt, not prompt-per-line (#1126 ) Previously, `ollama run` treated a non-terminal stdin (such as `ollama run model < file`) as containing one prompt per line. To run inference on a multi-line prompt, the only non-API workaround was to run `ollama run` interactively and wrap the prompt in `"""..."""`. Now, `ollama run` treats a non-terminal stdin as containing a single prompt. For example, if `myprompt.txt` is a multi-line file, then `ollama run model < myprompt.txt` would treat `myprompt.txt`'s entire contents as the prompt. Co-authored-by: Quinn Slack <quinn@slack.org>	2023-11-14 16:42:21 -05:00
Jeffrey Morgan	4e612a2e92	use stdout fd for terminal size (#1125 )	2023-11-14 16:09:09 -05:00
Jeffrey Morgan	6e0f686afa	`--format json` should work in interactive mode	2023-11-14 10:22:03 -05:00
Jeffrey Morgan	c1844bbee2	add json mode to cli (#1095 )	2023-11-13 21:54:02 -05:00
Michael Yang	bf6786bb39	fix tautology	2023-10-31 20:49:48 -07:00
Bruce MacDonald	f9a4281124	clean up: remove server functions from client (#937 )	2023-10-30 11:10:18 -04:00
Jeffrey Morgan	9ec16f0f03	fix formatting when exiting `ollama run`	2023-10-27 21:26:23 -07:00
Jeffrey Morgan	2d75a4537c	close input channel when receiving `io.EOF`	2023-10-27 20:26:04 -07:00
Patrick Devine	a79f030e75	add bracketed paste mode (#922 )	2023-10-26 15:57:00 -07:00
Patrick Devine	deeac961bb	new readline library (#847 )	2023-10-25 16:41:18 -07:00
Michael Yang	36c88cb9db	cmd: set ExactArgs	2023-10-18 14:40:48 -07:00
Bruce MacDonald	68d7255bd3	show request to server rather than local check (#778 )	2023-10-16 17:27:25 -04:00
Bruce MacDonald	a0c3e989de	deprecate modelfile embed command (#759 )	2023-10-16 11:07:37 -04:00
Bruce MacDonald	56497663c8	relay model runner error message to client (#720 ) * give direction to user when runner fails * also relay errors from timeout * increase timeout to 3 minutes	2023-10-12 11:16:37 -04:00
Michael Yang	2cfffea02e	handle client proxy	2023-10-09 12:33:47 -07:00
Patrick Devine	61ff1946e6	revise help text (#706 )	2023-10-05 11:36:07 -07:00
Alexander F. Rødseth	d104b7e997	Fix go test./... issue: fmt.Println arg list ends with redundant newline (#705 )	2023-10-05 11:11:04 -04:00
Patrick Devine	1852755154	show a default message when license/parameters/system prompt/template aren't specified (#681 )	2023-10-02 14:34:52 -07:00
Patrick Devine	99d5161e8a	don't wordwrap when stdout is redirected or piped (#662 )	2023-10-02 11:50:55 -07:00
Michael Yang	9333b0cc82	Merge pull request #612 from jmorganca/mxyng/prune-empty-directories prune empty directories	2023-09-29 11:23:39 -07:00
Patrick Devine	76db4a49cf	allow the user to cancel generating with ctrl-C (#641 )	2023-09-28 17:13:01 -07:00
Luc Stepniewski	4aa0976a2e	Added missing return preventing SIGSEGV because of missing resp (#621 ) Co-authored-by: Luc Stepniewski <luc@eclipse-fr.com>	2023-09-28 14:25:22 -07:00
Patrick Devine	92c20fdae6	fix error messages for unknown commands in the repl (#611 )	2023-09-28 14:19:45 -07:00
Michael Yang	f40b3de758	use int64 consistently	2023-09-28 11:07:24 -07:00
Michael Yang	8608eb4760	prune empty directories	2023-09-27 10:58:09 -07:00
Michael Yang	0625e805f0	fix model name not matching	2023-09-26 19:50:04 -07:00
Michael Yang	93d887e4bc	add painter message for exit	2023-09-25 16:30:22 -07:00
Patrick Devine	b5614f3ebc	fix end-of-line issue with the new prompt (#582 )	2023-09-23 17:20:30 -07:00
Jeffrey Morgan	01c44d687e	add multi line strings to final prompt	2023-09-23 00:27:24 -04:00
Jeffrey Morgan	e20362e0d5	fix multi line input in `ollama run`	2023-09-22 23:49:35 -04:00
Patrick Devine	c928ceb927	add word wrapping for lines which are longer than the terminal width (#553 )	2023-09-22 13:36:08 -07:00
Patrick Devine	87d9efb364	switch to forked readline lib which doesn't wreck the repl prompt (#578 )	2023-09-22 12:17:45 -07:00
Michael Yang	88897a90e4	fix ipv6 parse ip	2023-09-22 10:41:32 -07:00
Michael Yang	6137b12799	validate existence and pull model using api	2023-09-21 09:55:34 -07:00
Michael Yang	9297ff8330	fix OLLAMA_HOST parsing for ip6	2023-09-20 18:52:57 -07:00
Michael Yang	58ffa03d8b	fix impossible condition	2023-09-20 11:27:44 -07:00
Michael Yang	a5520bfb42	fix build	2023-09-19 10:42:24 -07:00
Michael Yang	b58d5d16b0	fix mkdir on windows	2023-09-19 09:41:13 -07:00
Patrick Devine	80dd44e80a	Cmd changes (#541 )	2023-09-18 12:26:56 -07:00
Patrick Devine	e7e91cd71c	add autoprune to remove unused layers (#491 )	2023-09-11 11:46:35 -07:00
Patrick Devine	1adfa67589	tighten up the error string for `ollama show` flags (#476 )	2023-09-06 13:38:49 -07:00
Patrick Devine	790d24eb7b	add show command (#474 )	2023-09-06 11:04:17 -07:00
Patrick Devine	8bbff2df98	add model IDs (#439 )	2023-08-28 20:50:24 -07:00
Quinn Slack	2ecc3a33c3	delete all models (not just 1st) in `ollama rm` (#415 ) Previously, `ollama rm model1 model2 modelN` would only delete `model1`. The other model command-line arguments would be silently ignored. Now, all models mentioned are deleted.	2023-08-26 00:47:56 -07:00
Michael Yang	9ec7e37534	Merge pull request #392 from jmorganca/mxyng/version add version	2023-08-22 09:50:25 -07:00
Michael Yang	2c7f956b38	add version	2023-08-22 09:40:58 -07:00
Jeffrey Morgan	a9f6c56652	fix `FROM` instruction erroring when referring to a file	2023-08-22 09:39:42 -07:00
Ryan Baker	0a892419ad	Strip protocol from model path (#377 )	2023-08-21 21:56:56 -07:00
Michael Yang	0ebec07569	Merge pull request #345 from jmorganca/exit-non-zero set non-zero error code on error	2023-08-16 09:20:28 -07:00
Blake Mizerany	67e593e355	cmd: support OLLAMA_CLIENT_HOST environment variable (#262 ) * cmd: support OLLAMA_HOST environment variable This commit adds support for the OLLAMA_HOST environment variable. This variable can be used to specify the host to which the client should connect. This is useful when the client is running somewhere other than the host where the server is running. The new api.FromEnv function is used to read configure clients from the environment. Clients wishing to use the environment variable being consistent with the Ollama CLI can use this new function. * Update api/client.go Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> * Update api/client.go Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> --------- Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2023-08-16 11:03:48 -04:00
Michael Yang	76b85bc0e9	set non-zero error code on error	2023-08-14 14:09:58 -07:00
Jeffrey Morgan	1556162c90	create `.ollama` directory if it doesnt exist	2023-08-11 15:35:55 -07:00
Patrick Devine	9770e3b325	Generate private/public keypair for use w/ auth (#324 )	2023-08-11 10:58:23 -07:00
Jeffrey Morgan	7e26a8df31	cmd: use environment variables for server options	2023-08-10 14:17:53 -07:00
Soroush Javadi	bea683e3bf	cmd: check GetBlobsPath error (#317 ) The error returned by `server.GetBlobsPath` in `showLayer` was never checked. Check the error and return if not nil. Also, make newlines at the end of error messages consistent and fix a typo.	2023-08-10 09:57:49 -07:00
Jeffrey Morgan	f65169b13e	clean up cli flags	2023-08-10 09:28:56 -07:00
Jeffrey Morgan	040a5b9750	clean up cli flags	2023-08-10 09:27:03 -07:00
Bruce MacDonald	a6f6d18f83	embed text document in modelfile	2023-08-08 11:27:17 -04:00
cmiller01	93492f1e18	correct precedence of serve params (args over env over default)	2023-08-07 19:55:20 +00:00
cmiller01	fb593b7bfc	pass flags to `serve` to allow setting allowed-origins + host and port * resolves: https://github.com/jmorganca/ollama/issues/300 and https://github.com/jmorganca/ollama/issues/282 * example usage: ``` ollama serve --port 9999 --allowed-origins "http://foo.example.com,http://192.0.0.1" ```	2023-08-07 03:34:37 +00:00
Bruce MacDonald	bd6d741d87	tell users to check the server error logs	2023-08-02 17:08:11 -04:00
Bruce MacDonald	8f8b6288ac	check server is running before running command	2023-08-02 10:51:23 -04:00
Bruce MacDonald	50e87c6691	read from os executable	2023-08-01 16:01:55 -04:00
Bruce MacDonald	40a25bf8c3	pr comments	2023-08-01 13:48:48 -04:00
Jeffrey Morgan	528bafa585	cache loaded model	2023-08-01 11:24:18 -04:00
Bruce MacDonald	36d6081ed1	find symlink of mac app	2023-07-31 17:38:10 -04:00
Bruce MacDonald	e72fe7945f	check server is running before running command	2023-07-31 16:25:57 -04:00
Bruce MacDonald	d1c098b038	tell users to check the server error logs	2023-07-31 11:49:33 -04:00
Patrick Devine	39bb25d5f6	allow multiline text using three double-quotes (#239 )	2023-07-29 13:35:23 -07:00
Patrick Devine	01d155c969	show system/template/license layers from cmd prompt (#223 )	2023-07-27 16:58:40 -07:00
Michael Yang	35af37a2cb	session id	2023-07-27 09:31:44 -07:00
Bruce MacDonald	4c1caa3733	download models when creating from modelfile	2023-07-25 14:25:13 -04:00
Bruce MacDonald	12ab8f8f5f	Revert "pull model on make if not present locally" This reverts commit 360a10ace391a674de60aa7b9b8cb65e8074027c.	2023-07-25 14:18:46 -04:00
Bruce MacDonald	8ebbd12f21	pull model on make if not present locally	2023-07-25 14:18:46 -04:00
Patrick Devine	4cb42ca55e	add copy command (#191 )	2023-07-24 11:27:28 -04:00
Patrick Devine	88c55199f8	change push to chunked uploads from monolithic (#179 )	2023-07-22 17:31:26 -07:00
Patrick Devine	6d6b0d3321	change error handler behavior and fix error when a model isn't found (#173 )	2023-07-21 23:02:12 -07:00
Patrick Devine	9f6e97865c	allow pushing/pulling to insecure registries (#157 )	2023-07-21 15:42:19 -07:00
Patrick Devine	e7a393de54	add rm command for models (#151 )	2023-07-20 16:09:23 -07:00
Patrick Devine	a3badaf103	add ls alias (#152 )	2023-07-20 15:28:27 -07:00
Michael Yang	55b5f5dc34	ctrl+c on empty line exits (#135 )	2023-07-20 00:53:08 -07:00
Patrick Devine	e4d7f3e287	vendor in progress bar and change to bytes instead of bibytes (#130 )	2023-07-19 17:24:03 -07:00
Michael Yang	dbc90576b8	add verbose/quiet commands	2023-07-19 13:34:56 -07:00
Michael Yang	84200dcde6	use readline	2023-07-19 13:34:56 -07:00
Michael Yang	a6d03dd510	Merge pull request #110 from jmorganca/fix-pull-0-bytes fix pull 0 bytes on completed layer	2023-07-18 19:38:59 -07:00
Michael Yang	68df36ae50	fix pull 0 bytes on completed layer	2023-07-18 19:38:11 -07:00
Michael Yang	d4cfee79d5	resolve modelfile before passing to server	2023-07-18 19:34:05 -07:00
Michael Yang	885f67a471	Merge pull request #92 from jmorganca/create-model-spinner Create model spinner	2023-07-18 11:15:45 -07:00
Patrick Devine	5bea29f610	add new list command (#97 )	2023-07-18 09:09:45 -07:00
Patrick Devine	4a28a2f093	add modelpaths (#96 )	2023-07-17 22:44:21 -07:00
Michael Yang	e4300e1eb7	add spinner to create	2023-07-17 14:15:42 -07:00
Michael Yang	aba706ea2d	remove unused persistent pre run	2023-07-17 14:14:57 -07:00
Patrick Devine	2fb52261ad	basic distribution w/ push/pull (#78 ) * basic distribution w/ push/pull * add the parser * add create, pull, and push * changes to the parser, FROM line, and fix commands * mkdirp new manifest directories * make `blobs` directory if it does not exist * fix go warnings * add progressbar for model pulls * move model struct --------- Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2023-07-16 17:02:22 -07:00
Michael Yang	743e957d88	use filepath for os compat	2023-07-14 17:27:14 -07:00
Michael Yang	1775647f76	continue conversation feed responses back into the llm	2023-07-13 17:13:00 -07:00
Michael Yang	05e08d2310	return more info in generate response	2023-07-13 09:37:32 -07:00
Michael Yang	0944b01e7d	pull fixes	2023-07-12 09:55:07 -07:00
Michael Yang	e243329e2e	check api status	2023-07-11 13:42:05 -07:00
Michael Yang	edba935d67	return error in generate response	2023-07-10 13:30:10 -07:00
Bruce MacDonald	07a4c1e3fb	take all args as one prompt	2023-07-10 06:05:09 -04:00

... 3 4 5 6 7 ...

470 commits