GopherProxy

	Reduce threads to 5 to speed up generation and allow more parallel lamas. - ann…
	git clone git://bitreich.org/annna/ git://enlrupgkhuxnvlhsf6lc3fziv5h2hhfrinws6…
	Log
	Files
	Refs
	Tags
	README
	---
	commit ec75159ce63799ac70abd66ac590bc0d80c7dcab
	parent 27908724ac2dfc9736de111f4a6a1ac89e4c949f
	Author: Annna Robert-Houdin <[email protected]>
	Date: Sat, 4 Jan 2025 19:49:16 +0100

	Reduce threads to 5 to speed up generation and allow more parallel lamas.

	Diffstat:
	M gpt \| 2 +-

	1 file changed, 1 insertion(+), 1 deletion(-)
	---
	diff --git a/gpt b/gpt
	@@ -19,7 +19,7 @@ fi

	prompt="$1"
	printf "%s\n" "${prompt}" \
	- \| $ggmlbin -m $ggmlmodel -n $ggmlntokens \
	+ \| $ggmlbin -m $ggmlmodel -n $ggmlntokens -t 5 \
	--simple-io --no-display-prompt --grammar 'root ::= ([^\x00-\x…
	-p "${systemprompt}" -cnv 2>/dev/null \
	\| sed -E '/^$/d;s/^>[[:blank:]]+//;q'