Questions on arm-gcc linking: .ARM.exidx and .relocate sections

Discussion:

(too old to reply)

pozz

2025-01-09 08:47:29 UTC

I studied the output files from a build process of Atmel Studio project
for SAMD20 MCU that is a Cortex-M0+.
The IDE uses arm-gcc compiler.

The strange thing I noticed was the last Flash address used: in lss it
is 2'06a4. Indeed lss file ends with:

00020694 <_fini>:
20694: b5f8 push {r3, r4, r5, r6, r7, lr}
20696: 46c0 nop ; (mov r8, r8)
20698: bcf8 pop {r3, r4, r5, r6, r7}
2069a: bc08 pop {r3}
2069c: 469e mov lr, r3
2069e: 4770 bx lr

000206a0 <__fini_array_start>:
206a0: 000008a5 .word 0x000008a5

However the map file emits additional 8 bytes in .ARM.exidx section:

*(.fini_array)
.fini_array 0x000206a0 0x4 c:/program files
(x86)/atmel/studio/7.0/toolchain/arm/arm-gnu-toolchain/bin/../lib/gcc/arm-none-eabi/6.3.1/thumb/v6-m/crtbegin.o
*(SORT(.fini_array.*))
0x000206a4 __fini_array_end = .
*crtbegin.o(.dtors)
*(EXCLUDE_FILE(*crtend.o) .dtors)
*(SORT(.dtors.*))
*crtend.o(.dtors)
0x000206a4 . = ALIGN (0x4)
0x000206a4 _efixed = .
[!provide] PROVIDE (__exidx_start, .)

.vfp11_veneer 0x000206a4 0x0
.vfp11_veneer 0x000206a4 0x0 linker stubs

.v4_bx 0x000206a4 0x0
.v4_bx 0x000206a4 0x0 linker stubs

.iplt 0x000206a4 0x0
.iplt 0x000206a4 0x0 c:/program files
(x86)/atmel/studio/7.0/toolchain/arm/arm-gnu-toolchain/bin/../lib/gcc/arm-none-eabi/6.3.1/thumb/v6-m/crtbegin.o

.ARM.exidx 0x000206a4 0x8
*(.ARM.exidx* .gnu.linkonce.armexidx.*)
.ARM.exidx 0x000206a4 0x8 c:/program files
(x86)/atmel/studio/7.0/toolchain/arm/arm-gnu-toolchain/bin/../lib/gcc/arm-none-eabi/6.3.1/thumb/v6-m\libgcc.a(_udivmoddi4.o)
[!provide] PROVIDE (__exidx_end, .)

.eh_frame 0x000206ac 0x0
.eh_frame 0x000206ac 0x0 c:/program files
(x86)/atmel/studio/7.0/toolchain/arm/arm-gnu-toolchain/bin/../lib/gcc/arm-none-eabi/6.3.1/thumb/v6-m/crtbegin.o

.rel.dyn 0x000206ac 0x0
.rel.iplt 0x000206ac 0x0 c:/program files
(x86)/atmel/studio/7.0/toolchain/arm/arm-gnu-toolchain/bin/../lib/gcc/arm-none-eabi/6.3.1/thumb/v6-m/crtbegin.o

.jcr 0x000206ac 0x0
.jcr 0x000206ac 0x0 c:/program files
(x86)/atmel/studio/7.0/toolchain/arm/arm-gnu-toolchain/bin/../lib/gcc/arm-none-eabi/6.3.1/thumb/v6-m/crtbegin.o

.igot.plt 0x000206ac 0x0
.igot.plt 0x000206ac 0x0 c:/program files
(x86)/atmel/studio/7.0/toolchain/arm/arm-gnu-toolchain/bin/../lib/gcc/arm-none-eabi/6.3.1/thumb/v6-m/crtbegin.o
0x000206ac . = ALIGN (0x4)
0x000206ac _etext = .

What is .ARM.exidx section? Its contents in my case is (from the .hex):

:0806A400149BFF7F0100000020

However I noticed another strange thing. The hex file that I use for
production doesn't end at address 2'06AC, but at address 2'08CC. There
are other 0x220=544 bytes.

After exploring the output files I found the .relocate sections in map
file. It seems it is linked to RAM (0x2000'0000):

.relocate 0x20000000 0x220 load address 0x000206ac
0x20000000 . = ALIGN (0x4)
0x20000000 _srelocate = .
*(.ramfunc .ramfunc.*)
*(.data .data.*)
.data.memset_func
0x20000000 0x4 src/mbedtls/library/platform_util.o
.data.g_interrupt_enabled
0x20000004 0x1
src/ports/samd20/ASF/common/utils/interrupt/interrupt_sam_nvic.o
0x20000004 g_interrupt_enabled
*fill* 0x20000005 0x3
.data.tzinfo 0x20000008 0x40 c:/program files
(x86)/atmel/studio/7.0/toolchain/arm/arm-gnu-toolchain/bin/../lib/gcc/arm-none-eabi/6.3.1/../../../../arm-none-eabi/lib/thumb/v6-m\libc_nano.a(lib_a-gettzinfo.o)
.data._impure_ptr
0x20000048 0x4 c:/program files
(x86)/atmel/studio/7.0/toolchain/arm/arm-gnu-toolchain/bin/../lib/gcc/arm-none-eabi/6.3.1/../../../../arm-none-eabi/lib/thumb/v6-m\libc_nano.a(lib_a-impure.o)
0x20000048 _impure_ptr
.data.impure_data
0x2000004c 0x60 c:/program files
(x86)/atmel/studio/7.0/toolchain/arm/arm-gnu-toolchain/bin/../lib/gcc/arm-none-eabi/6.3.1/../../../../arm-none-eabi/lib/thumb/v6-m\libc_nano.a(lib_a-impure.o)
.data.__global_locale
0x200000ac 0x16c c:/program files
(x86)/atmel/studio/7.0/toolchain/arm/arm-gnu-toolchain/bin/../lib/gcc/arm-none-eabi/6.3.1/../../../../arm-none-eabi/lib/thumb/v6-m\libc_nano.a(lib_a-locale.o)
0x200000ac __global_locale
.data._tzname 0x20000218 0x8 c:/program files
(x86)/atmel/studio/7.0/toolchain/arm/arm-gnu-toolchain/bin/../lib/gcc/arm-none-eabi/6.3.1/../../../../arm-none-eabi/lib/thumb/v6-m\libc_nano.a(lib_a-tzvars.o)
0x20000218 _tzname
0x20000220 . = ALIGN (0x4)
0x20000220 _erelocate = .

.bss 0x20000220 0x611c load address 0x000208d0
0x20000220 . = ALIGN (0x4)
0x20000220 _sbss = .
0x20000220 _szero = .

However I think it is linked in Flash and copied in RAM during startup code.

From what I understand, they are global/static variables initialized in
the declaration (with a startup value).

David Brown

2025-01-09 12:44:42 UTC

Permalink

Post by pozz
I studied the output files from a build process of Atmel Studio project
for SAMD20 MCU that is a Cortex-M0+.
The IDE uses arm-gcc compiler.
The strange thing I noticed was the last Flash address used: in lss it
.ARM.exidx      0x000206a4        0x8
*(.ARM.exidx* .gnu.linkonce.armexidx.*)
.ARM.exidx     0x000206a4        0x8 c:/program files
(x86)/atmel/studio/7.0/toolchain/arm/arm-gnu-toolchain/bin/../lib/gcc/arm-none-eabi/6.3.1/thumb/v6-m\libgcc.a(_udivmoddi4.o)
                [!provide]                PROVIDE (__exidx_end, .)

This shows that the ".ARM.exidx" section is being pulled in by the code
for the "_udivmoddi4.o" object file in libgcc.a. _udivmoddi4 is a
function that does division and modulo of 64-bit unsigned integers (on
targets that don't have a matching hardware instruction). But since it
is a "linkonce" section, it could also be pulled in by many other
functions - "linkonce" sections get merged automatically.

My understanding is that this section and the following few bytes are
required for stack unwinding for C++ exceptions. Even if you are not
using C++, or using it with exceptions disabled, there is still a very
small amount of such data generated and included in the C library
builds, because someone might call these functions in combination with
C++ exceptions. It is, I would say, too small to worry about in all but
the tightest memory situations.

Post by pozz
However I noticed another strange thing. The hex file that I use for
production doesn't end at address 2'06AC, but at address 2'08CC. There
are other 0x220=544 bytes.
After exploring the output files I found the .relocate sections in map
.relocate       0x20000000      0x220 load address 0x000206ac
                0x20000000                . = ALIGN (0x4)
                0x20000000                _srelocate = .
*(.ramfunc .ramfunc.*)
*(.data .data.*)
.data.memset_func
                0x20000000        0x4 src/mbedtls/library/platform_util.o
.data.g_interrupt_enabled
                0x20000004        0x1
src/ports/samd20/ASF/common/utils/interrupt/interrupt_sam_nvic.o
                0x20000004                g_interrupt_enabled

<snip>

Post by pozz
                0x20000220                _erelocate = .
.bss            0x20000220     0x611c load address 0x000208d0
                0x20000220                . = ALIGN (0x4)
                0x20000220                _sbss = .
                0x20000220                _szero = .
However I think it is linked in Flash and copied in RAM during startup code.

Yes.

Post by pozz
From what I understand, they are global/static variables initialized in
the declaration (with a startup value).

Yes, that is exactly what it is.

Uninitialised file-scope and static data in C goes in the ".bss"
section, linked to ram. There is code in the crt.o file (or another
startup file) that clears the .bss to zero.

Initialised file-scope and static data goes in the ".data" section.
This is linked to ram (i.e., the addresses of the variables are in ram)
but there is also a copy in flash with the initialisation data. The
pre-main startup code copies the data from flash to the .data section.

It is also possible to link functions to ram - they are copied across in
the same way (that's the ".ramfunc" section mentioned in your map file).
You might do this for speed-critical code on a microcontroller with
slow flash, or for functions used by flash programming routines.

pozz

2025-01-09 21:28:00 UTC

Permalink

Post by David Brown

Post by pozz
I studied the output files from a build process of Atmel Studio
project for SAMD20 MCU that is a Cortex-M0+.
The IDE uses arm-gcc compiler.
The strange thing I noticed was the last Flash address used: in lss it
.ARM.exidx      0x000206a4        0x8
  *(.ARM.exidx* .gnu.linkonce.armexidx.*)
  .ARM.exidx     0x000206a4        0x8 c:/program files (x86)/atmel/
studio/7.0/toolchain/arm/arm-gnu-toolchain/bin/../lib/gcc/arm-none-
eabi/6.3.1/thumb/v6-m\libgcc.a(_udivmoddi4.o)
                 [!provide]                PROVIDE (__exidx_end, .)

Ok, but what are those 8 bytes? Code? Values? It is strange there aren't
any info in lss.

Post by David Brown
My understanding is that this section and the following few bytes are
required for stack unwinding for C++ exceptions. Even if you are not
using C++, or using it with exceptions disabled, there is still a very
small amount of such data generated and included in the C library
builds, because someone might call these functions in combination with
C++ exceptions.

Thanks for the explanation that I take as is, without fully
understanding :-)

Post by David Brown
It is, I would say, too small to worry about in all but
the tightest memory situations.

Of course, yes. My question was "What is it?", not "How to save these 8
bytes?" if I don't need it.

Post by David Brown

Post by pozz
However I noticed another strange thing. The hex file that I use for
production doesn't end at address 2'06AC, but at address 2'08CC. There
are other 0x220=544 bytes.
After exploring the output files I found the .relocate sections in map
.relocate       0x20000000      0x220 load address 0x000206ac
                 0x20000000                . = ALIGN (0x4)
                 0x20000000                _srelocate = .
  *(.ramfunc .ramfunc.*)
  *(.data .data.*)
  .data.memset_func
                 0x20000000        0x4 src/mbedtls/library/
platform_util.o
  .data.g_interrupt_enabled
                 0x20000004        0x1 src/ports/samd20/ASF/common/
utils/interrupt/interrupt_sam_nvic.o
                 0x20000004                g_interrupt_enabled

<snip>

Post by pozz
                 0x20000220                _erelocate = .
.bss            0x20000220     0x611c load address 0x000208d0
                 0x20000220                . = ALIGN (0x4)
                 0x20000220                _sbss = .
                 0x20000220                _szero = .
However I think it is linked in Flash and copied in RAM during startup code.

Yes.

Post by pozz
From what I understand, they are global/static variables initialized
in the declaration (with a startup value).

I expected to see the non volatile copy in Flash of .relocate in the map
file. However only the copy in RAM is shown.

Post by David Brown
It is also possible to link functions to ram - they are copied across in
the same way (that's the ".ramfunc" section mentioned in your map file).
You might do this for speed-critical code on a microcontroller with
slow flash, or for functions used by flash programming routines.

David Brown

2025-01-10 10:03:11 UTC

Permalink

Post by pozz

Post by David Brown

Post by pozz
I studied the output files from a build process of Atmel Studio
project for SAMD20 MCU that is a Cortex-M0+.
The IDE uses arm-gcc compiler.
The strange thing I noticed was the last Flash address used: in lss
.ARM.exidx      0x000206a4        0x8
  *(.ARM.exidx* .gnu.linkonce.armexidx.*)
  .ARM.exidx     0x000206a4        0x8 c:/program files (x86)/atmel/
studio/7.0/toolchain/arm/arm-gnu-toolchain/bin/../lib/gcc/arm-none-
eabi/6.3.1/thumb/v6-m\libgcc.a(_udivmoddi4.o)
                 [!provide]                PROVIDE (__exidx_end, .)

Ok, but what are those 8 bytes? Code? Values? It is strange there aren't
any info in lss.

Unfortunately, if you want an answer to that, you need to dig into the
murky depths of how exception processing and stack unwinding are done.
It's complicated, it will involve a great deal of effort searching,
reading, experimenting, and analysing. And you'll learn pretty much
nothing of use unless you are thinking of making your own C++ compiler
from scratch - it's not even particularly useful if you are using C++
and have exceptions enabled. (Looking at the size of the sections might
be of interest to see the overhead exceptions have on code size.)

It would be nice if I could give you a clearer answer, or point you to a
simple explanation online, but I'm afraid I can't. And while I don't
know everything about this kind of thing, I know more than most - I have
an unhealthy interest in the details of toolchain. So if I can't give
you a full answer, you are probably just going to have to accept it as
an unexplained mystery unless you want to do a lot of googling. (If
someone else here actually knows more useful details, please let us know!)

Post by pozz

Thanks for the explanation that I take as is, without fully
understanding :-)

Most C functions are "transparent" to C++ exceptions. That is, if a C++
function "foo" has a try-catch block and calls the C function "bar"
which in turn calls the C++ function "foobar" which throws an exception,
then the throw handling will normally jump straight back to "foo" and
skip "bar" entirely.

But there are a few things that can complicate the process. gcc
extensions such as cleanup functions can be used in C code and must be
"unwound" like C++ destructors. setjmp/longjmp make a mess of
everything (as they always do). And some other mixes of C and C++
functions can be a little more complex.

Thus you end up with a small amount of data for stack unwinding and C++
exception handling even for C code, so that you can link that C code
with C++ code and use it freely.

Post by pozz

Post by David Brown
It is, I would say, too small to worry about in all but the tightest
memory situations.

Of course, yes. My question was "What is it?", not "How to save these 8
bytes?" if I don't need it.

Good, because that would be a much harder question to answer well!

Post by pozz

Post by David Brown

Post by pozz
However I think it is linked in Flash and copied in RAM during startup code.

Yes.

Post by pozz
From what I understand, they are global/static variables initialized
in the declaration (with a startup value).

I expected to see the non volatile copy in Flash of .relocate in the map
file. However only the copy in RAM is shown.

The map file shows the symbols - and the symbols are all in ram. The
only bits you see in the source copy in flash are for the start and end
of the block to copy.