Privacy Filter WebGPU
π΅
31
PII detection and text masking in your browser
π΅ zai-org/GLM-4.7-Flash
π΅ unsloth/GLM-4.7-Flash-GGUF
https://ai.azure.com/catalog/models/unsloth-glm-4.7-flash-gguf
hf-mem v0.4.1 now also estimates KV cache memory requirements for any context length and batch size with the --experimental flag!uvx hf-mem --model-id ... --experimental will automatically pull the required information from the Hugging Face Hub to include the KV cache estimation, when applicable.--max-model-len, --batch-size and --kv-cache-dtype arguments (Γ la vLLM) manually if preferred.