increase the size of uvm memory allocation #11

0x5459 · 2024-09-17T11:00:01Z

1<<16 is a very small number. I request to increase this limit. Perhaps we can discuss whether this limit can be removed.

From `1<<16` to `1<<32`

francescolavra · 2024-09-18T08:26:40Z

kernel-open/nvidia-uvm/uvm_kvmalloc.c

@@ -257,7 +257,7 @@ static void *alloc_internal(size_t size, bool zero_memory)
    // Make sure that (sizeof(hdr) + size) is what it should be
    BUILD_BUG_ON(sizeof(uvm_vmalloc_hdr_t) != offsetof(uvm_vmalloc_hdr_t *, ptr));

-    assert(size <= (1 << 16));
+    assert(size <= (1 << 32));


The (1 << 16) limit is there because the kernel heap used for these allocations supports a malloc-style interface (where you don't need to specify a size value when freeing memory) only for allocations sizes up to (1 << 16) (see the MAX_MCACHE_ORDER constant in the Nanos source at src/config.h). So it cannot be changed to a larger value.
If we want to remove this limit, we need to be able to use the vmalloc API when allocations larger that (1 << MAX_MCACHE_ORDER) are requested. This would entail:

changing the UVM_KMALLOC_THRESHOLD definition in kernel_open/nvidia_uvm/uvm_kvmalloc.h from infinity to (1 << MAX_MCACHE_ORDER)

changing the is_vmalloc_addr() definition in kernel_open/common/inc/nv_nanos.h from false to (objcache_from_object(u64_from_pointer(p), PAGESIZE_2M) == INVALID_ADDRESS)

changing the vfree() definition in kernel_open/common/inc/nv_nanos.h from kfree to NV_KFREE; this means the macro will take an additional size parameter, for which the macro invocations can use the alloc_size value of the corresponding uvm_vmalloc_hdr_t struct (all vmalloc allocations use this header)

francescolavra

Before submitting changes for review, the resulting code should be tested. These changes haven't even been build-tested.
Also, since we are removing the size limit for alloc_internal(), the assert() there should be removed altogether.

kernel-open/nvidia-uvm/uvm_kvmalloc.h

0x5459 · 2024-11-13T07:23:46Z

kernel-open/nvidia/linux_nvswitch.c

@@ -711,7 +711,7 @@ _nvswitch_os_free

    if (is_vmalloc_addr(ptr))
    {
-        vfree(ptr);
+        vfree(ptr, -1ull);


@francescolavra I don't know how to call vfree here. Please guide me.

Should I export the uvm_vmalloc_hdr_t structure for reference in kernel-open/nvidia/linux_nvswitch.c?

…e, get alloc_size from ptr

francescolavra · 2024-11-15T09:29:40Z

kernel-open/common/inc/nv-nanos.h

+    hdr = container_of(p, uvm_vmalloc_hdr_t, ptr);  \
+    NV_KFREE(p, hdr->alloc_size);                   \
+} while (0)
+


This won't work if memory is allocated from outside the nvidia-uvm module, because it won't have the uvm_vmalloc_hdr_t header.
To deal with the vfree() call in linux_nvswitch.c, I would not use vmalloc/vfree at all in the nvswitch code; instead, I would use use another Nanos kernel heap, i.e. get_kernel_heaps()->malloc, which can take -1 as size argument when freeing memory. So, _nvswitch_os_malloc() will be as below:

void *ptr = allocate(get_kernel_heaps()->malloc, size); if (ptr == INVALID_ADDRESS) return NULL; return ptr;

and _nvswitch_os_free() will be as below:

if (!ptr) return; deallocate((get_kernel_heaps()->malloc), ptr, -1ull);

The vfree() macro will only be called from the nvidia-uvm module, where we can easily extract the size parameter from the uvm_vmalloc_hdr_t header.

francescolavra · 2024-11-15T09:30:38Z

kernel-open/nvidia-uvm/uvm_kvmalloc.h

+#include <config.h>
+#define _CONFIG_H_
+#endif
+


This is not necessary, the config.h header is included internally by other Nanos header files.

increase the size of uvm memory allocation

0de451b

From `1<<16` to `1<<32`

francescolavra reviewed Sep 18, 2024

View reviewed changes

Correctly set the value of UVM_KMALLOC_THRESHOLD

7632b7c

0x5459 force-pushed the increase-uvm-alloc-size branch from e2ab22b to 7632b7c Compare November 4, 2024 03:25

francescolavra requested changes Nov 9, 2024

View reviewed changes

kernel-open/nvidia-uvm/uvm_kvmalloc.h Outdated Show resolved Hide resolved

use MAX_MCACHE_ORDER instead of hard-coded

32b874e

0x5459 force-pushed the increase-uvm-alloc-size branch from 8660efa to 32b874e Compare November 13, 2024 06:37

Correctly call vfree

d340014

0x5459 commented Nov 13, 2024

View reviewed changes

Export the uvm_vmalloc_hdr_t structure to nv-nanos.h. And modify vfre…

4bdb132

…e, get alloc_size from ptr

0x5459 force-pushed the increase-uvm-alloc-size branch from 1c3ccab to 4bdb132 Compare November 13, 2024 09:32

francescolavra reviewed Nov 15, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

increase the size of uvm memory allocation #11

increase the size of uvm memory allocation #11

0x5459 commented Sep 17, 2024

francescolavra Sep 18, 2024

francescolavra left a comment

0x5459 Nov 13, 2024 •

edited

Loading

0x5459 Nov 13, 2024

francescolavra Nov 15, 2024

francescolavra Nov 15, 2024

increase the size of uvm memory allocation #11

Are you sure you want to change the base?

increase the size of uvm memory allocation #11

Conversation

0x5459 commented Sep 17, 2024

francescolavra Sep 18, 2024

Choose a reason for hiding this comment

francescolavra left a comment

Choose a reason for hiding this comment

0x5459 Nov 13, 2024 • edited Loading

Choose a reason for hiding this comment

0x5459 Nov 13, 2024

Choose a reason for hiding this comment

francescolavra Nov 15, 2024

Choose a reason for hiding this comment

francescolavra Nov 15, 2024

Choose a reason for hiding this comment

0x5459 Nov 13, 2024 •

edited

Loading