Function Simulator

Anonymous published on 2024-08-23 included in Graph-Computing

The key class of function simulator is TopUnit. We can use TopUnit.CycleStep() to begin the execution of simulator.

In CycleStep(), we will call the Frontend() and Backend(), the former is used to update the many types of counters and the latter is used to execute instructions sequence.

In Backend(), we will call the following functions one by one

BackendCommit()
BackendMemory()
BackendExecute()
BackendSchedule()

Among all these functions, the third function BackendExecute() will execute all the instructions.

Debugging With GDB

Anonymous published on 2024-08-02 included in Project-Experience

How to launch a program

There are three ways to launch a program:

Program Loading

Anonymous published on 2024-08-02 included in Compile-Link

Before the program running, static loader or dynamic loader need to initialize the content of the progress stack which is stipulated by System V Application Binary Interface.

ELF File Format Analysis

Anonymous published on 2024-06-13 included in Compile-Link

What is ELF ?

ELF: Executable and Linkable Format, 可执行与可链接格式

Heterogeneous Compilation

Anonymous published on 2024-06-07 included in Compile-Link

与 CUDA Compilation 不同在于

The tools used in the CUDA compilation are all closed source except gcc, g++ etc., for example fatbinary and nvlink. We need to substitue these tools to tools in clang system.

Clang Offload Bundler is used to combined different code for different machine structurel.
Clang Offload Packager is used to embed device code into host code.
Clang Linker Wrapper is used to .

这里面最复杂的感觉是怎么处理链接关系，如果仅仅说代码嵌入，从 CUDA 的流程来看，在 cudafe1.cpp include stub.c 生成 .o 这一步中 host code 中就已经包含了 device code，如果仅仅说 embed 的话，这显然就已经完成了，但是为何 CUDA 还进行后续那么多步骤，因为这一步生成的 .o 显然是无法运行的，device code 都还只是一个 extern signal，还需要同 CUDA runtime library 进行链接，这个过程该怎么进行比较难想。

Several Methods for Obtaining Time

Anonymous published on 2024-06-04 included in HPC

We can reference this article

About the last method that using rdtsc assembly command to obtain time, there are some error prone points, we can reference to this article.

If you want to learn about the TST, please reference to this article.