Enable D-Cache for Cortex-M7 #485

salkinium · 2020-09-29T18:33:28Z

The Data Cache for Cortex-M7 devices is currently disables, due to a lack of in-depth understanding of cache policies during porting. The Instruction Cache however is enabled, since it only gets read.

See: https://github.com/modm-io/modm/blob/develop/src/modm/platform/core/cortex/startup.c.in#L95-L98

In addition to enabling the cache, the caches must be invalidated manually on certain operations (writing Flash for example).
This is however not just an issue on Cortex-M7, since most Cortex-M devices have some sort of vendor specific cache implementation for their Flash reads, which must also be manually invalided.

cc @mikewolfram

mikewolfram · 2020-09-29T18:50:11Z

I had this already on my list since I was wondering why the I-Cache is enabled, but not the D-Cache.

First time I saw this when porting the FreeRTOS HAL based network interface, but at least for the M7 the macro does nothing to invalidate the cache.

salkinium · 2020-09-29T19:06:49Z

Yes, I just remembered this due to getting confused about DTCM with D-Cache while fixing the DTCM size bug. I wasn't sure during porting what the invalidation required from an modm API perspective, so I just disabled it. Not great, not terrible.

salkinium · 2020-11-06T17:10:11Z

Leaving this here for my future self:
https://blog.feabhas.com/2020/10/introduction-to-the-arm-cortex-m7-cache-part-1-cache-basics/

salkinium · 2021-06-22T07:48:57Z

Also good: https://alexkalmuk.medium.com/cpu-caches-with-examples-for-arm-cortex-m-2c05a339246e

We should enable the D-Cache at least with write-through. Write-back can then be enabled by the user if required.

salkinium · 2021-06-22T07:57:36Z

Disadvantages of the ‘write-through’ mode are the following:

Sequential and frequent access to the same memory address can degrade performance.

You still need to do a cache invalidate after the end of DMA operations.

There is the “Data corruption in a sequence of Write-Through stores and loads” bug in some versions of Cortex-M7

Well… maybe not. Sounds like we would need support from the DMA API.

mikewolfram · 2021-06-23T08:19:07Z

I recently had an issue on a F765, where I enabled the D cache and Ethernet stopped working. Probably related to the DMA.

salkinium · 2021-06-23T08:55:52Z

Yeah, it seems to require application support to manually invalidate the cache when required (ie. after (during?) DMA transfers), and tbh the user can still enable the DCache in main() so there's no need for modm to enable it (wrongly).

I'm instead going to add some docs on this fact to the modm:platform:cortex-m module and just delegate this to the future.

ghost · 2022-02-03T21:59:19Z

Yeah, it seems to require application support to manually invalidate the cache when required (ie. after (during?) DMA transfers), and tbh the user can still enable the DCache in main() so there's no need for modm to enable it (wrongly).

There is a better solution, an MPU is really good to define cache policies. It gives better performance since you don't have to invalidate the cache, it only costs a new section in the linker script. You can see a example below on a helper class I have made to build MPU configurations. It's made in Boost style syntax but it is pretty easy to port. If you wish i can send it to you.

void mpu_setup()
{
    using namespace msl;

    auto dma_buffers = [] {
        mpu::region_builder<1, 0x38000000, 2048, mpu::region_type::normal> reg;
        reg.set_cache_policy(mpu::cache_policy::non_cacheable);
        reg.set_access_policy(mpu::access_policy::privileged_only);
        reg.update_rnr();
        reg.enable();
        return reg.build();
    };

    constexpr auto rdb = dma_buffers();
    mpu::update(rdb);

    mpu::enable(mpu::mode::default_memory_map);
}

salkinium · 2022-02-03T22:34:59Z

Oh, very nice, I hadn't thought about using the MPU that way!
I want to use the MPU to guard against fiber stack overflows, but this is also a very good use-case, so I'll happily have a look at your helper class please!

ghost · 2022-02-04T01:14:37Z

The header is posted here. I have modified it so it compiles in modm. 😃

The example below is tested on a stm32h743 and it should work on any cm4 and cm7 architecture, only the number of regions needs to be changed.

#include <modm/board.hpp>
#include <modm/processing.hpp>
#include <modm/platform/clock/rcc.hpp>

#include "mpu.hpp"

using namespace Board;

void mpu_setup()
{
    constexpr auto const stack_sentinal = [] {
        mpu::region_builder<0x24070000, 32, mpu::region_type::normal> reg;
        reg.set_cache_policy(mpu::cache_policy::write_back);
        reg.set_access_policy(mpu::access_policy::no_access);
        reg.update_rnr(0);
        reg.enable();
        return reg.build();
    }();
    mpu::update(stack_sentinal);

    constexpr auto const dma_buffers = [] {
        mpu::region_builder<0x38000000, 2048, mpu::region_type::normal> reg;
        reg.set_cache_policy(mpu::cache_policy::non_cacheable);
        reg.set_access_policy(mpu::access_policy::privileged_only);
        reg.update_rnr(1);
        reg.enable();
        return reg.build();
    }();
    mpu::update(dma_buffers);

    mpu::enable(mpu::mode::default_memory_map);
}

int
main()
{
	Board::initialize();
	Led::setOutput();

	RCC->AHB2ENR |= RCC_AHB2ENR_SRAM1EN_Msk | RCC_AHB2ENR_SRAM2EN_Msk | RCC_AHB2ENR_SRAM3EN_Msk;

	mpu_setup();

	[[maybe_unused]] volatile auto stack_crash = reinterpret_cast<std::uint32_t*>(0x24070000U);
	*stack_crash = 0;

ghost · 2022-02-04T10:01:58Z

I just discovered a bug. It should be fixed in the region_builder class, the MPU is very critical with correct alignment and can give some hairy situations if it's wrong.

static_assert((Address & 0x1f) == 0, "Invalid alignment");

// Should be changed to:
static_assert((Address & (Size - 1)) == 0, "Invalid alignment");

salkinium · 2022-02-04T22:30:39Z

I love it, this is great, do you want to add a modm:platform:mpu module?

We already have a modm_faststack section that we can align(32) to add a the space for the guard, this could be very handy for that. We will have to see if the region builder requires a simpler runtime version, since the specific stack object address is only known at link-time, so the constexpr won't work there, but I'm sure we can still make it efficient.

ghost · 2022-02-06T10:24:52Z

I love it, this is great, do you want to add a modm:platform:mpu module?

Yes, later... Right now missing SPI/DMA support is more important.

rleh · 2022-02-06T11:51:52Z

DMA and SPI with DMA support exists for all STM32 chips, see #371, #608, #629 and #772.

ghost · 2022-02-06T17:19:18Z

DMA and SPI with DMA support exists for all STM32 chips, see #371, #608, #629 and #772.

I can't find it?. It's advanced SPI on stm32h743vit6. 4 to 32 bits data frame, fifo buffers, etc...

rleh · 2022-02-06T18:01:19Z

Oh, I see. STM32H7 family completely is missing a SPI driver currently, see peripheral matrix. (Not sure if that is just not tested/enabled, but identical to the SPI peripherals in other STM32 families or if ST put a new SPI IP into the STM32H7 controllers which requires a new driver.)

For all other targets SpiMasterN and SpiMasterN_Dma HAL will be generated once you include the lbuild modulesmodm:platform:spi:N and modm:platform:dma.

e.g.: https://docs.modm.io/develop/api/stm32f745zgt7/

chris-durand · 2022-02-06T19:32:03Z

Oh, I see. STM32H7 family completely is missing a SPI driver currently, see peripheral matrix. (Not sure if that is just not tested/enabled, but identical to the SPI peripherals in other STM32 families or if ST put a new SPI IP into the STM32H7 controllers which requires a new driver.)

The H7 SPI has a completely different register map and needs a new driver.

salkinium added enhancement 🌈 help wanted 🛠 labels Sep 29, 2020

salkinium mentioned this issue Jan 29, 2021

[board] Nucleo-F767ZI support #542

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable D-Cache for Cortex-M7 #485

Enable D-Cache for Cortex-M7 #485

salkinium commented Sep 29, 2020

mikewolfram commented Sep 29, 2020

salkinium commented Sep 29, 2020

salkinium commented Nov 6, 2020

salkinium commented Jun 22, 2021

salkinium commented Jun 22, 2021

mikewolfram commented Jun 23, 2021

salkinium commented Jun 23, 2021

ghost commented Feb 3, 2022

salkinium commented Feb 3, 2022

ghost commented Feb 4, 2022

ghost commented Feb 4, 2022 •

edited by ghost

salkinium commented Feb 4, 2022

ghost commented Feb 6, 2022

rleh commented Feb 6, 2022

ghost commented Feb 6, 2022

rleh commented Feb 6, 2022

chris-durand commented Feb 6, 2022

Enable D-Cache for Cortex-M7 #485

Enable D-Cache for Cortex-M7 #485

Comments

salkinium commented Sep 29, 2020

mikewolfram commented Sep 29, 2020

salkinium commented Sep 29, 2020

salkinium commented Nov 6, 2020

salkinium commented Jun 22, 2021

salkinium commented Jun 22, 2021

mikewolfram commented Jun 23, 2021

salkinium commented Jun 23, 2021

ghost commented Feb 3, 2022

salkinium commented Feb 3, 2022

ghost commented Feb 4, 2022

ghost commented Feb 4, 2022 • edited by ghost

salkinium commented Feb 4, 2022

ghost commented Feb 6, 2022

rleh commented Feb 6, 2022

ghost commented Feb 6, 2022

rleh commented Feb 6, 2022

chris-durand commented Feb 6, 2022

ghost commented Feb 4, 2022 •

edited by ghost