Update ollama_dl.py - for larger files and continuos download #1

sireskay · 2024-10-01T15:05:43Z

Notice that I am getting errors for models such as qwen2.5:14b, and any other large model. here are the changes we made

Updated format_size function:
Now it can handle sizes in GB, which is necessary for files larger than 4GB.
New download_with_resume function:
This function handles the actual downloading, including the ability to resume partial downloads. It uses a temporary file and renames it only after a successful download.
Improved download_blob function:

Increased max_retries to 5 by default.
Implemented exponential backoff for retries.
Uses the new download_with_resume function.

Timeout adjustment:
In the download function, we set timeout=httpx.Timeout(None) for the httpx client, allowing for very long downloads without timing out.
Chunk size adjustment:
In the download_with_resume function, we use a chunk size of 8192 bytes (8KB) to read the response. This smaller chunk size allows for more frequent progress updates and potentially better memory management for very large files.

akx

Thank you! A couple comments within, and could you ensure the code is properly formatted with ruff? It looks like there are some spurious changes that remove newlines and make it non-compliant with the Ruff code style. I just added a pre-commit workflow to master if that helps.

akx · 2024-10-21T05:37:49Z

ollama_dl.py

+    headers = {}
+    mode = "wb"
+
+    if temp_path.exists():


Since the temporary path is a new one every time we enter this function, and the current time, fractional seconds and all included, there's practically zero chance that this will ever be true. 🤔

akx · 2024-10-21T05:38:06Z

ollama_dl.py

+        else:
+            temp_path.unlink()


I'm not sure why we'd want to remove the tempfile and start anew if it already looks complete?

akx · 2024-10-21T05:38:57Z

ollama_dl.py

+            log.debug(f"Content-Length: {content_length}, Expected total size: {total_size}")
+
+            with temp_path.open(mode) as f:
+                async for chunk in resp.aiter_bytes(8192):


8192 bytes is tiny for multi-gigabyte files.

The large buffer size was chosen on purpose to avoid as many read OS calls. See e.g. pytorch/pytorch#116536 (comment) for an explanation.

akx · 2024-10-21T05:39:50Z

ollama_dl.py

 async def download(*, registry: str, name: str, version: str, dest_dir: str):
    with Progress() as progress:
-        async with httpx.AsyncClient() as client:
+        async with httpx.AsyncClient(timeout=httpx.Timeout(None)) as client:


I think there should be some timeout.

sireskay added 2 commits October 1, 2024 10:02

Update ollama_dl.py

cb29e54

Update ollama_dl.py

c03edfe

akx requested changes Oct 21, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update ollama_dl.py - for larger files and continuos download #1

Update ollama_dl.py - for larger files and continuos download #1

sireskay commented Oct 1, 2024

akx left a comment

akx Oct 21, 2024

akx Oct 21, 2024

akx Oct 21, 2024

akx Oct 21, 2024

Update ollama_dl.py - for larger files and continuos download #1

Are you sure you want to change the base?

Update ollama_dl.py - for larger files and continuos download #1

Conversation

sireskay commented Oct 1, 2024

akx left a comment

Choose a reason for hiding this comment

akx Oct 21, 2024

Choose a reason for hiding this comment

akx Oct 21, 2024

Choose a reason for hiding this comment

akx Oct 21, 2024

Choose a reason for hiding this comment

akx Oct 21, 2024

Choose a reason for hiding this comment