# Shared Virtual Memory for the SCC: bare metal programming for future many-core architectures

Pablo Reble

March 23, 2012







### Agenda

- Parallel Programming Concepts
- SCC
- MetalSVM
- SVM subsystem
- Application
- Demo







# Parallel Programming Concepts

• Message Passing (MPI)

- process parallelism
- explicit communication
- Shared Memory (OpenMP)
  - loop/thread parallelism
  - implicit communication
  - coherent memory required







### <u>MARC</u>

The Single-chip Cloud Computer experimental processor is a concept vehicle created by Intel Labs as a platform for many-core software research.

- stands for: Many-Core Application Research Community
- launched in 2010 by Intel
- intention: provide access to future processor architectures to a broader audience
- sponsored Symposium, twice a Year in Europe
- http://communities.intel.com/community/marc



Chair for Operating Systems



- presented in 1994
- 32 bit intel architecture
- 75-100 MHz
- 3.3 volt
- on-chip APIC
- multiprocessor capability
- instruction to invalidate cache-located tagged data: CLIINV











- presented in 1994
- 32 bit intel architecture
- 75-100 MHz 100 MHz-1 GHz
- 3.3 volt
- on-chip APIC
- multiprocessor capability
- instruction to invalidate cache-located tagged data: CLIINV



Chair for Operating Systems





- presented in 1994
- 32 bit intel architecture
- 75-100 MHz 100 MHz-1 GHz
- <del>3.3 volt</del> 0.66 V 1.16 V
- on-chip APIC
- multiprocessor capability
- instruction to invalidate cache-located tagged data: CLIINV



Chair for Operating Systems





- presented in 1994
- 32 bit intel architecture
- 75-100 MHz 100 MHz-1 GHz
- <del>3.3 volt</del> 0.66 V 1.16 V
- on-chip APIC
- multiprocessor capability
- instruction to invalidate cache-located tagged data: CLIINV





Shared Virtual Memory for the SCC:

March 23, 2012



#### SCC Environment







#### **SCC Environment**







**SCC Environment** 



state-of-the-art 2D mesh interconnect







#### **SCC Environment**







**SCC Environment** 



![](_page_12_Picture_4.jpeg)

![](_page_12_Picture_8.jpeg)

![](_page_13_Picture_0.jpeg)

#### **SCC Environment**

![](_page_13_Figure_3.jpeg)

6

March 23, 2012

![](_page_13_Picture_7.jpeg)

![](_page_14_Picture_0.jpeg)

### **Rocky Lake Processor**

![](_page_14_Picture_3.jpeg)

![](_page_14_Picture_7.jpeg)

![](_page_14_Picture_8.jpeg)

![](_page_15_Picture_0.jpeg)

### **Rocky Lake Platform**

![](_page_15_Picture_3.jpeg)

![](_page_15_Picture_8.jpeg)

![](_page_16_Picture_0.jpeg)

### **Default Configuration**

- SCC provides shared but not coherent memory
- Cluster-like programming environment
- Separate Linux booted on each core
- Shell-script to start processes
- RCCE ['roki] light-weight message passing library

![](_page_16_Picture_9.jpeg)

![](_page_16_Picture_12.jpeg)

![](_page_17_Picture_0.jpeg)

### **Default Configuration**

- SCC provides shared but not coherent memory
- Cluster-like programming environment
- Separate Linux booted on each core
- Shell-script to start processes
- RCCE ['roki] light-weight message passing library

![](_page_17_Picture_9.jpeg)

![](_page_17_Picture_12.jpeg)

![](_page_18_Picture_0.jpeg)

### **Default Configuration**

- SCC provides shared but not coherent memory
- Cluster-like programming environment
- Separate Linux booted on each core
- Shell-script to start processes
- RCCE ['roki] light-weight message passing library

![](_page_18_Picture_9.jpeg)

Chair for Operating Systems

![](_page_19_Picture_0.jpeg)

## <u>RCCE</u>

- light-weight communication environment
- local put, remote get approach
- uses MPB to realize blocking, synchronous message passing

![](_page_19_Figure_6.jpeg)

![](_page_19_Picture_8.jpeg)

![](_page_19_Picture_11.jpeg)

![](_page_20_Picture_0.jpeg)

## <u>RCCE</u>

- light-weight communication environment
- local put, remote get approach
- uses MPB to realize blocking, synchronous message passing

![](_page_20_Figure_6.jpeg)

10

![](_page_20_Picture_8.jpeg)

![](_page_20_Picture_10.jpeg)

![](_page_21_Picture_0.jpeg)

## <u>RCCE</u>

- light-weight communication environment
- local put, remote get approach
- uses MPB to realize blocking, synchronous message passing

![](_page_21_Figure_6.jpeg)

![](_page_21_Picture_11.jpeg)

![](_page_22_Picture_0.jpeg)

<u>SVM</u>

#### • VM?

- virtual address space of a process is mapped onto a physical address space
- almost all UNIX system implementations, including Linux, use demand paging to manage the allocation of physical memory
- SVM?
  - concept of a single address space shared by a number of processors
  - strategies to generate coherent but distributed memory

![](_page_22_Picture_10.jpeg)

![](_page_23_Picture_0.jpeg)

![](_page_23_Figure_2.jpeg)

Project Goal

![](_page_23_Picture_4.jpeg)

- work in progress: research grant by Intel Labs Braunschweig
- shared virtual memory for many-core systems
- bare-metal hypervisor based approach

![](_page_24_Picture_0.jpeg)

- work in progress: research grant by Intel Labs Braunschweig
- shared virtual memory for many-core systems
- bare-metal hypervisor based approach

![](_page_24_Figure_5.jpeg)

### First SVM Prototype

Chair for Operating Systems

![](_page_24_Picture_7.jpeg)

![](_page_25_Picture_0.jpeg)

### History

- tiny OS kernel for education: eduOS (since 2009)
- 1<sup>st</sup> MARC Symposium (Braunschweig 2010)
  - presented basic ideas to integrate an SVM system into a bare-metal Hypervisor
- 3<sup>rd</sup> MARC Symposium (Ettlingen 2011)
  - Comm. and Synch. Layer (Focus on HW Synch. Support and High Concurrency)
- 4<sup>th</sup> MARC Symposium (Potsdam 2012)
  - SVM Prototype and first application benchmark

![](_page_25_Picture_11.jpeg)

![](_page_26_Picture_0.jpeg)

### lguest

Lguest is a small x86 32-bit Linux hypervisor [...] serves as an excellent springboard for mastering the theory and practice of x86 virtualization [...] You should also be inspired to create your own hypervisor, using your own pets as logo. – Rusty Russell '07

![](_page_26_Picture_7.jpeg)

![](_page_26_Picture_8.jpeg)

![](_page_27_Picture_0.jpeg)

### lguest

Lguest is a small x86 32-bit Linux hypervisor [...] serves as an excellent springboard for mastering the theory and practice of x86 virtualization [...] You should also be inspired to create your own hypervisor, using your own pets as logo. – Rusty Russell '07

![](_page_27_Picture_4.jpeg)

![](_page_27_Picture_6.jpeg)

Chair for Operating Systems

![](_page_28_Picture_0.jpeg)

### Prototype

#### handles SVM related data:

- use write trough strategy
- enable Level 1 caching only
- tag related Cache-Lines as MPBT
- Consequences:
  - + use write combining buffer
  - + hardware support for invalidation
  - no use of Level 2 Cache

![](_page_28_Picture_12.jpeg)

![](_page_29_Picture_0.jpeg)

### **Memory Consistency Models**

#### • Strong Memory Consistency Model:

Exactly one owner per page with read/write permissions

16

- allow dynamical change of ownership
- use messages to handle change

![](_page_29_Picture_8.jpeg)

![](_page_29_Picture_10.jpeg)

![](_page_30_Picture_0.jpeg)

### **Memory Consistency Models**

- Strong Memory Consistency Model:
  - Exactly one owner per page with read/write permissions
  - allow dynamical change of ownership
  - use messages to handle change
- Lazy Release Consistency:
  - Application explicitly controls Consistency (svm\_barrier)

16

![](_page_30_Picture_10.jpeg)

![](_page_30_Picture_12.jpeg)

![](_page_31_Picture_0.jpeg)

- First steps to apply Shared Memory Programming on the SCC use a small subset of SMI:
  - svm\_alloc
  - svm\_flush
  - svm\_invalidate

![](_page_31_Picture_7.jpeg)

![](_page_31_Picture_10.jpeg)

![](_page_32_Picture_0.jpeg)

### Visualize Change of Ownership

![](_page_32_Figure_3.jpeg)

![](_page_32_Picture_7.jpeg)

![](_page_33_Picture_0.jpeg)

### Visualize Change of Ownership

![](_page_33_Figure_3.jpeg)

18

![](_page_33_Picture_4.jpeg)

![](_page_33_Picture_7.jpeg)

![](_page_34_Picture_0.jpeg)

### Visualize Change of Ownership

![](_page_34_Figure_3.jpeg)

18

![](_page_34_Picture_4.jpeg)

March 23, 2012

![](_page_35_Picture_0.jpeg)

### Visualize Change of Ownership

![](_page_35_Figure_3.jpeg)

18

![](_page_35_Picture_4.jpeg)

March 23, 2012

![](_page_36_Picture_0.jpeg)

### Visualize Change of Ownership

![](_page_36_Figure_3.jpeg)

18

![](_page_36_Picture_7.jpeg)

![](_page_37_Picture_0.jpeg)

### Visualize Change of Ownership

18

![](_page_37_Figure_3.jpeg)

Private offchip DRAM

![](_page_37_Figure_5.jpeg)

![](_page_37_Picture_6.jpeg)

![](_page_37_Picture_9.jpeg)

![](_page_38_Picture_0.jpeg)

- Stencil app (part of RCCE)
- Dirichlet Bounding Condition
- Solver: Jacobi Over Relaxation algorithm
- Synchronous behavior

Shared Memory: Barrier between iterations Message Passing: implicitly

- SCC Platform running with 533 MHz core and 800 MHz memory/mesh
- double precision

![](_page_38_Picture_10.jpeg)

![](_page_39_Picture_0.jpeg)

#### access pattern

![](_page_39_Figure_3.jpeg)

20

![](_page_39_Picture_7.jpeg)

![](_page_40_Picture_0.jpeg)

#### access pattern

![](_page_40_Figure_3.jpeg)

Shared Virtual Memory for the SCC:

20

![](_page_40_Picture_7.jpeg)

![](_page_41_Picture_0.jpeg)

### First Results

![](_page_41_Figure_3.jpeg)

**RWTH**AACHEN UNIVERSITY

- Five Stamp Stencil
  - ▶ Problem size 1024 × 512

![](_page_42_Picture_0.jpeg)

### First Results

![](_page_42_Figure_3.jpeg)

**RWTH**AACHEN UNIVERSITY

- Five Stamp Stencil
  - Problem size 1024 × 512

![](_page_43_Picture_0.jpeg)

### First Results

![](_page_43_Figure_3.jpeg)

RWTHAACHEN UNIVERSITY

- Five Stamp Stencil
  - ▶ Problem size 1024 × 512

![](_page_44_Picture_0.jpeg)

### First Results

![](_page_44_Figure_3.jpeg)

**RWITH**AACHEN UNIVERSITY

- Five Stamp Stencil
  - ▶ Problem size 1024 × 512

![](_page_45_Picture_0.jpeg)

# **Conclusion and Outlook**

### Conclusion

- First Prototype of MetalSVM is running
- Results are promising

#### <u>Outlook</u>

- Boot Linux on multiple hypervisor instances
- Connect two SCCs
- plan a release in 2012 metalsvm.org

![](_page_45_Picture_10.jpeg)

![](_page_45_Picture_13.jpeg)

![](_page_46_Picture_0.jpeg)

# **Conclusion and Outlook**

### **Conclusion**

- First Prototype of MetalSVM is running
- Results are promising

### <u>Outlook</u>

- Boot Linux on multiple hypervisor instances
- Connect two SCCs
- plan a release in 2012 metalsvm.org

![](_page_46_Picture_10.jpeg)

![](_page_46_Picture_13.jpeg)

![](_page_47_Picture_0.jpeg)

### Demo

![](_page_47_Picture_2.jpeg)

Shared Virtual Memory for the SCC:

![](_page_47_Picture_5.jpeg)

Chair for Operating Systems

![](_page_47_Picture_7.jpeg)