• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

A unified approach to mapping and routing on a network-on-chip for both best-effort and guaranteed service traffic (2007)

by A Hansson, K Goossens, A Rǎdulescu
Venue:Article ID 68432
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 12
Next 10 →

Outstanding Research Problems in NoC Design: Circuit-, Microarchitecture-, and System-Level Perspectives

by Radu Marculescu, Umit Y. Ogras, Li-shiuan Peh, Natalie Enright Jerger, Yatin Hoskote
"... Abstract—Networks-on-Chip (NoCs) have been recently proposed to replace global interconnects in order to alleviate complex communication problems. While several research problems concerning NoC design have been already addressed in the literature, many others remain to be solved. In this work, we fi ..."
Abstract - Cited by 52 (1 self) - Add to MetaCart
Abstract—Networks-on-Chip (NoCs) have been recently proposed to replace global interconnects in order to alleviate complex communication problems. While several research problems concerning NoC design have been already addressed in the literature, many others remain to be solved. In this work, we first provide a general description of NoC architectures and applications. Then, we enumerate several related research problems organized under five main categories: Application characterization, communication paradigm, communication infrastructure, analysis and solution evaluation. Motivation, problem formulation, proposed approaches and open issues are discussed for each problem enumerated in the paper from circuit, micro-architecture and systemlevel perspectives. Finally, we address the interactions among these research problems and put the NoC design process into perspective. Index terms — On-chip communication architecture, networks-onchip, multiprocessor system-on-chip, CMP. I.

A Statically Scheduled Time-Division-Multiplexed Network-on-Chip for Real-Time Systems

by Martin Schoeberl, Florian Br, Jens Sparsø, Evangelia Kasapaki
"... Abstract—This paper explores the design of a circuitswitched network-on-chip (NoC) based on time-divisionmultiplexing (TDM) for use in hard real-time systems. Previous work has primarily considered application-specific systems. The work presented here targets general-purpose hardware platforms. We c ..."
Abstract - Cited by 18 (8 self) - Add to MetaCart
Abstract—This paper explores the design of a circuitswitched network-on-chip (NoC) based on time-divisionmultiplexing (TDM) for use in hard real-time systems. Previous work has primarily considered application-specific systems. The work presented here targets general-purpose hardware platforms. We consider a system with IP-cores, where the TDM-NoC must provide directed virtual circuits – all with the same bandwidth – between all nodes. This may not be a frequent scenario, but a general platform should provide this capability, and it is an interesting point in the design space to study. The paper presents an FPGA-friendly hardware design, which is simple, fast, and consumes minimal resources. Furthermore, an algorithm to find minimum-period schedules for all-to-all virtual circuits on top of typical physical NoC topologies like 2D-mesh, torus, bidirectional torus, tree, and fat-tree is presented. The static schedule makes the NoC timepredictable and enables worst-case execution time analysis of communicating real-time tasks. Keywords-real-time systems; network-on-chip I.
(Show Context)

Citation Context

...The allocation here proceeds in two steps. First, routing paths are determined through the NoC depending on a mapping of an application to the network and the application’s communication requirements =-=[14]-=-. Given these paths, TDM time slots are allocated for each virtual circuit in turn [15]. This technique has been extended to split packets and deliver the individual fragments of the packet over multi...

Energy efficient application mapping to noc processing elements operating at multiple voltage levels

by Pavel Ghosh, Arunabha Sen - in Proceedings of the 2009 3rd ACM/IEEE International Symposium on Networks-on-Chip , 2009
"... An efficient technique for mapping application tasks to heterogeneous processing elements (PEs) on a Network-on-Chip (NoC) platform, operating at multiple voltage levels, is presented in this paper. The goal of the mapping is to minimize energy consumption subject to the performance constraints. Suc ..."
Abstract - Cited by 7 (0 self) - Add to MetaCart
An efficient technique for mapping application tasks to heterogeneous processing elements (PEs) on a Network-on-Chip (NoC) platform, operating at multiple voltage levels, is presented in this paper. The goal of the mapping is to minimize energy consumption subject to the performance constraints. Such a mapping involves solving several subproblems. Most of the research effort in this area often address these subproblems in a sequential fashion or a subset of them. We take a unified approach to the problem without compromising the solution time and provide techniques for optimal and heuristic solutions. We prove that the voltage assignment component of the problem itself is NP-hard and is inapproximable within any constant factor. Our optimal solution utilizes a Mixed Integer Linear Program (MILP) formulation of the problem. The heuristic utilizes MILP relaxation and randomized rounding. Experimental results based on E3S benchmark applications and a few real applications show that our heuristic produces near-optimal solution in a fraction of time needed to find the optimal. 1
(Show Context)

Citation Context

...data paths, i.e., traffic movement on the NoC architecture. As consideration of all four subproblems simultaneously increases the complexity of the problem, most of the research effort in this domain =-=[4, 6, 8, 9]-=- either solve problems (i), (ii), (iii) and (iv) in a sequential fashion, or solve only a subset of them. To find an energy efficient application mapping, all four problems (i)-(iv) have to be solved....

Static Routing in Symmetric Real-Time Network-on-Chips

by Florian Brandner, Martin Schoeberl
"... With the rising number of cores on a single chip the question on how to organize the communication among those cores becomes more and more relevant. A common solution is to use a network-on-chip (NoC) that provides communication bandwidth, routing, and arbitration among the cores. The use of NoCs in ..."
Abstract - Cited by 5 (2 self) - Add to MetaCart
With the rising number of cores on a single chip the question on how to organize the communication among those cores becomes more and more relevant. A common solution is to use a network-on-chip (NoC) that provides communication bandwidth, routing, and arbitration among the cores. The use of NoCs in real-time systems is problematic, since the shared network and all cores connected to it have to be analyzed to derive time bounds of real-time tasks. We propose to use a statically scheduled, time-divisionmultiplexed NoC design that allows a decoupled analysis of individual real-time tasks. Our network provides virtual circuits between all cores. These virtual circuits are implemented by delivering messages periodically on a static, fixed routing schedule. Since the routing does not change, it can be pre-computed offline. This work focuses on the computation of routing schedules for symmetric NoC topologies, e.g., torus and hypercube. Due to the symmetry, the all-to-all communication can be modeled via simplified communication patterns that are concurrently processed by all routers. The scheduling problem is solved by a heuristic that tries to maximize the overlap of active patterns. Our experiments show that, for larger networks, our heuristic yields schedule lengths that are only 15 % to 20 % longer than theoretical lower bounds. Categories and Subject Descriptors C.3 [Special-Purpose and Application-Based Systems]: Real-time and embedded systems;
(Show Context)

Citation Context

...n requirements of a specific application. Along with the communication schedule, also the hardware is customized. For instance, buffers are often required to resolve conflicts on shared network links =-=[4, 9]-=-. This is problematic in safetycritical systems, as changes in the communication requirements then incur fundamental changes to the system architecture and hardware design. The approach followed by th...

Multi-path routing in time-divisionmultiplexed networks on chip,” in VLSI-SoC

by R Stefan, K Goossens , 2009
"... ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
Abstract not found
(Show Context)

Citation Context

...mentation but with more relaxed constraints regarding timing. A solution for performing mapping, routing, slot allocation in the Æthereal networks, currently in used by the tool flow was presented in =-=[23]-=-. III. PROPOSED ALGORITHMS We propose and demonstrate a multi-path routing technique within the framework of Æthereal [17]. The Æthereal network offers a combination of packet and circuit switching me...

Run-time Mapping of Applications on FPGA-based Reconfigurable Systems

by Ivan Beretta, Vincenzo Rana, David Atienza, Donatella Sciuto
"... GAs) in System-on-Chip (SoC) design considerably increased in the last few years. Their established importance is due to the large amount of hardware resources they offer, as well as to their increasing performance, and furthermore to the support for reconfigurability. Even though FPGAs seem to have ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
GAs) in System-on-Chip (SoC) design considerably increased in the last few years. Their established importance is due to the large amount of hardware resources they offer, as well as to their increasing performance, and furthermore to the support for reconfigurability. Even though FPGAs seem to have reached their maturity, there is still a lack of Computer-Aided Design (CAD) tools able to deal with dynamic reconguration. Existing algorithms aim at optimizing the performance of a set of applications, basing the computation on classic metrics (such as communication overhead), while reconfiguration-related issues are not taken into consideration. This work proposes a design methodology to map several applications on the FPGA area at run-time. Starting from a basic solution found at design-time for the initial set of applications, the proposed algorithm makes it possible to map a new application (not known at design-time), both minimizing the number of synthesis processes and optimizing the on-chip performance of the new application. Experimental results show that the proposed approach is able to achieve up to a 18 % reduction in the number of reconfigurations with respect to an off-line static-mapping approach, while generally preserving the performance of the executed applications on the FPGA. I.
(Show Context)

Citation Context

...ce, in [5] the authors propose a mapping algorithm to optimize the communication latency between the cores, which can be applied to a promising communication paradigm called Network-on-Chip (NoC). In =-=[6]-=-, the authors combine mapping and routing into a unified problem, which they solve using an algorithm that minimizes the complexity of the network. Finally, in [7] an algorithm that minimizes area and...

A Mapping Flow for Dynamically Reconfigurable Multi-Core System-on-Chip Design

by Ivan Beretta, Vincenzo Rana, David Atienza, Donatella Sciuto
"... Abstract—Nowadays, multi-core systems-on-chip (SoCs) are typically required to execute multiple complex applications, which demand a large set of heterogeneous hardware cores with different sizes. In this context, the popularity of dynamically reconfigurable platforms is growing, as they increase th ..."
Abstract - Add to MetaCart
Abstract—Nowadays, multi-core systems-on-chip (SoCs) are typically required to execute multiple complex applications, which demand a large set of heterogeneous hardware cores with different sizes. In this context, the popularity of dynamically reconfigurable platforms is growing, as they increase the ability of the initial design to adapt to future modifications. This paper presents a design flow to efficiently map multiple multi-core applications on a dynamically reconfigurable SoC. The proposed methodology is tailored for a reconfigurable hardware architecture based on a flexible communication infrastructure, and exploits applications similarities to obtain an effective mapping. We also introduce a run-time mapper that is able to introduce new applications that were not known at design-time, preserving the mapping of the original system. We apply our design flow to a real-world multimedia case study and to a set of synthetic benchmarks, showing that it is actually able to extract similarities among the applications, as it achieves an average improvement of 29 % in terms of reconfiguration latency with respect to a communication-oriented approach, while preserving the same communication performance. Index Terms—Field programmable gate arrays, platformbased design, reconfigurable architectures, run-time adaptability. I.
(Show Context)

Citation Context

... to optimize the communication overhead of an application mapped onto a NoC. The algorithm in [12] performs mapping and path selection in order to minimize area requirements and power consumption. In =-=[13]-=-, the authors proposed a technique to evaluate and optimize core mapping, path selection, and time-slot assignment at the same time. However, all these approaches are proposed for a design-time scenar...

2009 International Conference on Microelectronics Run-Time Mapping for Dynamically-Added Applications in Reconfigurable Embedded Systems

by Ivan Beretta, Vincenzo Rana, David Atienza, Marco D. Santambrogio, Donatella Sciuto
"... Abstract — The increasing popularity of multi-core System-on-Chip platforms introduces new challenges, both in terms of hardware platforms and design methodologies. Dynamic reconfiguration can be exploited to increase the flexibility of the system and to implement multiple applications, since it is ..."
Abstract - Add to MetaCart
Abstract — The increasing popularity of multi-core System-on-Chip platforms introduces new challenges, both in terms of hardware platforms and design methodologies. Dynamic reconfiguration can be exploited to increase the flexibility of the system and to implement multiple applications, since it is possible to easily switch between them by reconfiguring part of the device at run-time. Additionally, new applications may be included in the system after the design time synthesis has been completed. This paper addresses the problem of mapping new applications on the device area at run-time, by reusing existing components of the system. We propose an heuristic technique that is able to determine how the new application should be mapped in a short time and, thanks to the reuse policy, to immediately deploy the solution on the device. The proposed algorithm also takes into consideration two conflicting performance metrics, in order to generate a good quality result. I.
(Show Context)

Citation Context

... for NoC architectures, which aim at minimizing other on-chip performance metrics. For instance, the algorithm proposed in [7] minimizes area requirements and power consumption, while the approach in =-=[8]-=- applies a unified mapping and routing algorithm to reduce the network complexity. A mapper that exploits dynamic reconfiguration to map multiple applications has been proposed in [9]. The aim of the ...

On the Capacity of Bufferless Networks-on-Chip

by Er Shpiner, Erez Kantor, Pu Li, Israel Cidon, Isaac Keslassy
"... Abstract—Networks-on-Chip (NoCs) form an emerging paradigm for communications within chips. In particular, bufferless NoCs require significantly less area and power consumption, but also pose novel major scheduling problems to achieve full capacity. In this paper, we provide first insights on the ca ..."
Abstract - Add to MetaCart
Abstract—Networks-on-Chip (NoCs) form an emerging paradigm for communications within chips. In particular, bufferless NoCs require significantly less area and power consumption, but also pose novel major scheduling problems to achieve full capacity. In this paper, we provide first insights on the capacity of bufferless NoCs. In particular, we present optimal periodic schedules for several bufferless NoCs with a completeexchange traffic pattern. These schedules particularly fit distributed-programming models and network congestioncontrol mechanisms. Finally, we analytically evaluate the performance of our scheduling algorithms. A. Background I.
(Show Context)

Citation Context

...tterns. In particular, the UMARS algorithm relies on an offline scheduling by ordering the flows by their bandwidth requirements prior to scheduling them greedily by shortest and less contented route =-=[32]-=-. Other works suggest algorithms for route optimization with multiple paths and shortest latency by keeping in-order arrivals in offline [33], [34] and online [35], [36] modes. However, all these work...

doi:10.1155/2010/603059 Research Article 3D Network-on-Chip Architectures Using Homogeneous

by Heterogeneous Floorplans, Vitor De Paulo, Cristinel Ababei , 2010
"... License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. We propose new 3D 2-layer and 3-layer NoC architectures that utilize homogeneous regular mesh networks on a separate layer and one or two heterogeneous floorplanning l ..."
Abstract - Add to MetaCart
License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. We propose new 3D 2-layer and 3-layer NoC architectures that utilize homogeneous regular mesh networks on a separate layer and one or two heterogeneous floorplanning layers. These architectures combine the benefits of compact heterogeneous floorplans and of regular mesh networks. To demonstrate these benefits, a design methodology that integrates floorplanning, routers assignment, and cycle-accurate NoC simulation is proposed. The implementation of the NoC on a separate layer offers an additional area that may be utilized to improve the network performance by increasing the number of virtual channels, buffers size, or mesh size. Experimental results show that increasing the number of virtual channels rather than the buffers size has a higher impact on network performance. Increasing the mesh size can significantly improve the network performance under the assumption that the clock frequency is given by the length of the physical links. In addition, the 3-layer architecture can offer significantly better network performance compared to the 2-layer architecture. 1.
(Show Context)

Citation Context

...ling based floorplanning algorithm (the concept of unifying different design flow steps to better explore the design solution space has been applied successfully for example to mapping and routing in =-=[45]-=-.) However, this becomes computationally too expensive due to the long CPU runtimes required by the cycle-accurate simulator. 6. Experimental Results 6.1. Experimental Setup and Testcases. We implemen...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University