## Software Decomposition for Multicore Architectures

Ankit Jain, Ravi Shankar Florida Atlantic University

## Multicore Architectures: Software Reuse Challenge

- Reverse Engineer existing (legacy) software to port it on next generation Multicore Architectures
- Partition existing code over Multiple concurrent cores
- Transform existing Sequential Model based software to Concurrent Model based software

## Methodology

- Ten Steps uses
  - Bottom-up Annotation
  - Middle-Out Analysis
  - Top-Down Representation
- Maximizes Software Reuse
- Performs Concurrency Modeling and Performance Simulations
- Selects MA and decomposed software architecture to meet QoS requirements

## Results

• Analysis was performed on 2-Core and 4-Core architecture solutions for 2 Algorithms

| Algorithm 1                                                                                                                                                                                                                                                                              | Algorithm 2                                                                                                                                                                                                                                                                                                                    |
|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Properties:                                                                                                                                                                                                                                                                              | Properties:                                                                                                                                                                                                                                                                                                                    |
| <ol> <li>Master Processor – Processed CCD,<br/>CCDPP, CNTRL &amp; REDEYE + Slave<br/>Management</li> <li>Slave Processor(s) – Processed<br/>Parallelized CODEC</li> <li>Pre-Expanded Instructions are passed<br/>to the Slave. Leads to heavy<br/>communication over the Bus.</li> </ol> | <ol> <li>Master Processor – Slave Management</li> <li>Slave Processor(s) – Processed<br/>Parallelized CCD, CCDPP, CNTRL,<br/>REDEYE, CODEC</li> <li>Slaves expands the instructions locally<br/>based on code and data received.</li> <li>Significant reduction in usage of the Bus<br/>as compared to Algorithm 1.</li> </ol> |
| <b>Performance:</b><br>4-Core showed over 23% improvement<br>over 2-Core implementation                                                                                                                                                                                                  | Performance:<br>4-Core showed over 65% (near 3x)<br>improvement over 2-Core implementation.<br>Algorithm 2 also showed significant<br>reduction in total execution cost as<br>compared to Algorithm 1.                                                                                                                         |