New debug draw extension for AABBs #900

keptsecret · 2025-07-08T08:45:47Z

No description provided.

include/nbl/ext/DebugDraw/CDrawAABB.h

include/nbl/ext/DebugDraw/builtin/hlsl/common.hlsl

devshgraphicsprogramming · 2025-07-11T20:09:35Z

include/nbl/ext/DebugDraw/builtin/hlsl/common.hlsl

+struct PSInput
+{
+    float32_t4 position : SV_Position;
+    float32_t4 color : TEXCOORD0;


why are you using HW attributes for color? the color is per-instance

ah these are Vx-Px inter-stage shenanigans, I'll allow, make sure you label color flat though

still not labelled as flat interpolation

include/nbl/ext/DebugDraw/builtin/hlsl/aabb_instances.fragment.hlsl

include/nbl/ext/DebugDraw/builtin/hlsl/aabb_instances.vertex.hlsl

include/nbl/ext/DebugDraw/CDrawAABB.h

src/nbl/ext/DebugDraw/CDrawAABB.cpp

devshgraphicsprogramming · 2025-09-16T14:21:29Z

include/nbl/ext/DebugDraw/CDrawAABB.h

+	    ~DrawAABB() override;
+
+    private:
+        static bool validateCreationParameters(SCreationParameters& params);


make it an operator bool() const of the creation params

devshgraphicsprogramming · 2025-09-16T14:22:52Z

include/nbl/ext/DebugDraw/CDrawAABB.h

+
+        struct SCreationParameters : SCachedCreationParameters
+        {
+            video::IQueue* transfer = nullptr;


comment that its only needed to make the 24 element index buffer and not used for anything later

devshgraphicsprogramming · 2025-09-16T16:09:39Z

src/nbl/ext/DebugDraw/CDrawAABB.cpp

+DrawAABB::DrawAABB(SCreationParameters&& params, core::smart_refctd_ptr<video::IGPUGraphicsPipeline> singlePipeline, smart_refctd_ptr<IGPUGraphicsPipeline> batchPipeline, smart_refctd_ptr<IGPUBuffer> indicesBuffer)
+    : m_cachedCreationParams(std::move(params)), m_singlePipeline(std::move(singlePipeline)), m_batchPipeline(std::move(batchPipeline)),
+    m_indicesBuffer(std::move(indicesBuffer))


nitpick you could roll all these parameters into a struct so you have less typing as you move between create, new, the constructor and initialization

devshgraphicsprogramming · 2025-09-16T16:10:54Z

src/nbl/ext/DebugDraw/CDrawAABB.cpp

+#ifdef NBL_EMBED_BUILTIN_RESOURCES
+	auto archive = make_smart_refctd_ptr<builtin::CArchive>(smart_refctd_ptr(logger));
+	system->mount(smart_refctd_ptr(archive), archiveAlias.data());
+#else
+	auto NBL_EXTENSION_MOUNT_DIRECTORY_ENTRY = (path(_ARCHIVE_ABSOLUTE_ENTRY_PATH_) / NBL_ARCHIVE_ENTRY).make_preferred();
+	auto archive = make_smart_refctd_ptr<nbl::system::CMountDirectoryArchive>(std::move(NBL_EXTENSION_MOUNT_DIRECTORY_ENTRY), smart_refctd_ptr(logger), system);
+	system->mount(smart_refctd_ptr(archive), archiveAlias.data());
+#endif


yeah I'm starting to think that the HLSL includes can't be found because of a bad archive alias (esp in the non-embedded case)

also how do you protect against mounting the same archive twice ?

devshgraphicsprogramming · 2025-09-16T16:11:49Z

src/nbl/ext/DebugDraw/CDrawAABB.cpp

+	const auto validation = std::to_array
+	({
+		std::make_pair(bool(creationParams.assetManager), "Invalid `creationParams.assetManager` is nullptr!"),
+		std::make_pair(bool(creationParams.assetManager->getSystem()), "Invalid `creationParams.assetManager->getSystem()` is nullptr!"),


this can be asserted

src/nbl/ext/DebugDraw/CDrawAABB.cpp

devshgraphicsprogramming · 2025-09-17T09:38:56Z

include/nbl/system/ISystem.h

        //
        virtual inline bool isDirectory(const system::path& p) const
        {
+            // TODO: fix bug, input "nbl/ext/DebugDraw/builtin/hlsl" -> returs true when no such dir present in mounted stuff due to how it uses parent paths in loop (goes up up till matches "nbl" builtin archive and thinks it resolved the requested dir)


@AnastaZIuk open an issue about it

devshgraphicsprogramming · 2025-09-17T09:46:36Z

src/nbl/ext/DebugDraw/CDrawAABB.cpp

+	if (!system->exists(path(NBL_ARCHIVE_ENTRY) / "common.hlsl", {}))
+		mount(smart_refctd_ptr<ILogger>(params.utilities->getLogger()), system.get(), NBL_ARCHIVE_ENTRY);


shouldn't this check be inside mount ? https://github.com/Devsh-Graphics-Programming/Nabla/pull/900/files#r2353004514

actually yes it could be

devshgraphicsprogramming · 2025-09-17T09:47:35Z

src/nbl/ext/DebugDraw/CDrawAABB.cpp

+// note we use archive entry explicitly for temporary compiler include search path & asset cwd to use keys directly
+constexpr std::string_view NBL_ARCHIVE_ENTRY = _ARCHIVE_ENTRY_KEY_;


where does this variable come from and does everything still build without embedded resources cmake option?

where does this variable come

_ARCHIVE_ENTRY_KEY_ is private define set with CMake, scoped to the lib's sources only

does everything still build without embedded resources cmake option?

builds and runs

devshgraphicsprogramming · 2025-09-17T09:49:27Z

src/nbl/ext/DebugDraw/CDrawAABB.cpp

+			CHLSLCompiler::SOptions options = {};
+			options.stage = stage;
+			options.preprocessorOptions.sourceIdentifier = filePath;
+			options.preprocessorOptions.logger = params.utilities->getLogger();
+			options.preprocessorOptions.includeFinder = includeFinder.get();
+			shaderSrc = compiler->compileToSPIRV((const char*)shaderSrc->getContent()->getPointer(), options);
+
+			return params.utilities->getLogicalDevice()->compileShader({ shaderSrc.get() });


@AnastaZIuk can we make @YasInvolved use CMake to make precompiled SPIR-V with NSC right way in a future PR to CI the shaders?

devshgraphicsprogramming · 2025-09-17T10:16:28Z

src/nbl/ext/DebugDraw/CDrawAABB.cpp

+	auto getRequiredAccessFlags = [&](const bitflag<IDeviceMemoryAllocation::E_MEMORY_PROPERTY_FLAGS>& properties)
+		{
+			bitflag<IDeviceMemoryAllocation::E_MAPPING_CPU_ACCESS_FLAGS> flags(IDeviceMemoryAllocation::EMCAF_NO_MAPPING_ACCESS);
+
+			if (properties.hasFlags(IDeviceMemoryAllocation::EMPF_HOST_READABLE_BIT))
+				flags |= IDeviceMemoryAllocation::EMCAF_READ;
+			if (properties.hasFlags(IDeviceMemoryAllocation::EMPF_HOST_WRITABLE_BIT))
+				flags |= IDeviceMemoryAllocation::EMCAF_WRITE;
+
+			return flags;
+		};


only check for write flag on the mapping

devshgraphicsprogramming · 2025-09-17T10:59:00Z

src/nbl/ext/DebugDraw/CDrawAABB.cpp

+	smart_refctd_ptr<IGPUBuffer> indicesBuffer;
+	params.utilities->createFilledDeviceLocalBufferOnDedMem(
+		SIntendedSubmitInfo{ .queue = params.transfer },
+		std::move(bufparams),
+		unitAABBIndices.data()
+	).move_into(indicesBuffer);


it would be slightly less complex (wouldn't have to take the utilities in the creation params) if you just shot off a single use commandbuffer with updateBuffer command since the update is less than 64kb (you just need the usage flag on the buffer to allow it)

utilities is still used to get logical device and logger elsewhere. Though we could pass those into creation params separately instead

devshgraphicsprogramming · 2025-09-17T10:59:54Z

src/nbl/ext/DebugDraw/CDrawAABB.cpp

+		.offset = 0,
+		.size = sizeof(SPushConstants)
+	};
+	return device->createPipelineLayout({ &pcRange , 1 }, nullptr, nullptr, nullptr, nullptr);


why no tcall the createPipelineLayoutFromPCRange

devshgraphicsprogramming · 2025-09-17T11:01:19Z

src/nbl/ext/DebugDraw/CDrawAABB.cpp

+		singlePipeline = createPipeline(params, params.singlePipelineLayout.get(), "single.vertex.hlsl", "aabb_instances.fragment.hlsl");
+		if (!singlePipeline)
+		{
+			logger->log("Failed to create pipeline!", ILogger::ELL_ERROR);
+			return nullptr;
+		}
+	}
+
+	smart_refctd_ptr<IGPUGraphicsPipeline> batchPipeline = nullptr;
+	if (params.drawMode & ADM_DRAW_BATCH)
+	{
+		batchPipeline = createPipeline(params, params.batchPipelineLayout.get(), "aabb_instances.vertex.hlsl", "aabb_instances.fragment.hlsl");


shouldn't you default the layouts if they're missing?

devshgraphicsprogramming · 2025-09-17T11:01:49Z

src/nbl/ext/DebugDraw/CDrawAABB.cpp

+	SPushConstantRange pcRange = {
+		.stageFlags = IShader::E_SHADER_STAGE::ESS_VERTEX,
+		.offset = 0,
+		.size = sizeof(SPushConstants)


your createDefaultPipelineLayout needs to take an enum about what pipeline this is for, cause you use two different push constant structs

devshgraphicsprogramming · 2025-09-17T11:15:48Z

src/nbl/ext/DebugDraw/CDrawAABB.cpp

+	return true;
+}
+
+hlsl::float32_t4x4 DrawAABB::getTransformFromAABB(const hlsl::shapes::AABB<3, float>& aabb)


this should return 3x4 not 4x4

devshgraphicsprogramming · 2025-09-17T11:16:44Z

src/nbl/ext/DebugDraw/CDrawAABB.cpp

+bool DrawAABB::renderSingle(IGPUCommandBuffer* commandBuffer, const hlsl::shapes::AABB<3, float>& aabb, const hlsl::float32_t4& color, const hlsl::float32_t4x4& cameraMat)
+{
+	if (!(m_cachedCreationParams.drawMode & ADM_DRAW_SINGLE))
+	{
+		m_cachedCreationParams.utilities->getLogger()->log("DrawAABB has not been enabled for draw single!", ILogger::ELL_ERROR);
+		return false;
+	}
+
+	commandBuffer->bindGraphicsPipeline(m_singlePipeline.get());
+	commandBuffer->setLineWidth(1.f);


make a struct with command buffer pointer, camera viewProj matrix and line width to use for both functions

devshgraphicsprogramming · 2025-09-17T11:17:15Z

src/nbl/ext/DebugDraw/CDrawAABB.cpp

+	SSinglePushConstants pc;
+
+	hlsl::float32_t4x4 instanceTransform = getTransformFromAABB(aabb);
+	pc.instance.transform = hlsl::mul(cameraMat, instanceTransform);


promoted_mul to mul 4x4 with 3x4

devshgraphicsprogramming · 2025-09-17T11:19:14Z

src/nbl/ext/DebugDraw/CDrawAABB.cpp

+	std::vector<InstanceData> instances(aabbInstances.size());
+	for (uint32_t i = 0; i < aabbInstances.size(); i++)
+	{
+		auto& inst = instances[i];
+		inst = aabbInstances[i];
+		inst.transform = hlsl::mul(cameraMat, inst.transform);
+	}


can we skip having this temporary vector and do this loop as a lambda we call instead of the memcpy below?

devshgraphicsprogramming · 2025-09-17T11:20:46Z

src/nbl/ext/DebugDraw/CDrawAABB.cpp

+	auto instancesIt = instances.begin();
+	const uint32_t instancesPerIter = streaming->getBuffer()->getSize() / sizeof(InstanceData);
+	using suballocator_t = core::LinearAddressAllocatorST<offset_t>;
+	while (instancesIt != instances.end())


you'll deadlock if the streaming buffer is not large enough, check aabbInstances.size()>instancesPerIter and return false

Isn't that why we loop below this? We update and draw AABBs in batches that fit the streaming buffer size. Or am I misunderstanding how the streaming buffer works?

…ing in params struct

…and batch

keptsecret added 8 commits July 1, 2025 11:04

Merge branch 'mesh_loaders' into new_debug_draw

865e606

latest example

ca86128

merge master, fix conflicts

0ae3da2

latest example

cd2ef95

merge master

fe55bd7

added debug draw aabb extension, moved from ex

98ccfb2

removed todos

a755514

support hlsl AABBs, also OBBs with transform

473592b