I was trying to find where the clang compiler is writing out constant global data values, and didn’t manage to find it by code inspection. If I run ltrace (also tracing system calls), I see the point where the ELF object is written out:
1 2 3 4 5 6 7 8 9 10 11 12 13 14 | std::string::compare(std::string const&) const(0x7ffc8983a190, 0x1e32e60, 7, 254) = 5 std::string::compare(std::string const&) const(0x1e32e60, 0x7ffc8983a190, 7, 254) = 0xfffffffb std::string::compare(std::string const&) const(0x7ffc8983a190, 0x1e32e60, 7, 254) = 5 write@SYS(4, "\177ELF\002\001\001" , 848) = 848 lseek@SYS(4, 40, 0) = 40 write@SYS(4, "\220\001" , 8) = 8 lseek@SYS(4, 848, 0) = 848 lseek@SYS(4, 60, 0) = 60 write@SYS(4, "\a" , 2) = 2 lseek@SYS(4, 848, 0) = 848 std::basic_string<char, std::char_traits<char>, std::allocator<char> >::~basic_string()(0x1e2a2e0, 0x1e2a2e8, 0x1e27978, 0x1e27978) = 0 rt_sigprocmask@SYS(2, 0x7ffc8983bb58, 0x7ffc8983bad8, 8) = 0 close@SYS(4) = 0 rt_sigprocmask@SYS(2, 0x7ffc8983bad8, 0, 8) = 0 |
This is from running:
The -S is to display syscalls as well as library calls. To my suprise, this seems to show calls to libstdc++ library calls, but I’m not seeing much from clang itself, just:
clang::DiagnosticsEngine::DiagnosticsEngine clang::driver::ToolChain::getTargetAndModeFromProgramName llvm::cl::ExpandResponseFiles llvm::EnablePrettyStackTrace llvm::errs llvm::install_fatal_error_handler llvm::llvm_shutdown llvm::PrettyStackTraceEntry::PrettyStackTraceEntry llvm::PrettyStackTraceEntry::~PrettyStackTraceEntry llvm::raw_ostream::preferred_buffer_size llvm::raw_svector_ostream::write_impl llvm::remove_fatal_error_handler llvm::StringMapImpl::LookupBucketFor llvm::StringMapImpl::RehashTable llvm::sys::PrintStackTraceOnErrorSignal llvm::sys::Process::FixupStandardFileDescriptors llvm::sys::Process::GetArgumentVector llvm::TimerGroup::printAll
There’s got to be a heck of a lot more that the compiler is doing!? It turns out that ltrace doesn’t seem to trace out all the library function calls that lie in shared libraries (I’m using a shared library + split dwarf build of clang). The default output was a bit deceptive since I saw some shared lib calls, in particular the there were std::… calls (from libstc++.so) in the ltrace output. My conclusion seems to be that the tool is lying by default.
This can be confirmed by explicitly asking to see the functions from a specific shared lib. For example, if I call ltrace as:
1 2 3 4 5 6 | $ ltrace -S --demangle -e @libLLVMX86CodeGen.so \ /clang/be .b226a0a /bin/clang-3 .9 \ -cc1 \ -triple \ x86_64-unknown-linux-gnu \ ... |
Now I get ~68K calls to libLLVMX86CodeGen.so functions that didn’t show up in the default ltrace output! The ltrace tool won’t show me these by default (although the man page seems to suggest that it should), but if I narrow down what I’m looking through to a single shared lib, at least I can now examine the function calls in that shared lib.
On the SONAME
Note that the @lib….so name has to match the SONAME. For example if the shared libraries on disk were:
libLLVMX86CodeGen.so -> libLLVMX86CodeGen.so.3
libLLVMX86CodeGen.so.3 -> libLLVMX86CodeGen.so.3.9
libLLVMX86CodeGen.so.3.9 -> libLLVMX86CodeGen.so.3.9.0
would give you the name to use. This becomes relevant in clang 4.0 where the SONAME ends up with .so.4 instead of just .so (when building clang with shared libs instead of archive libs).