TY - GEN
T1 - ISO/OSI compliant network-on-chip implementation for CNN applications
AU - Malki, Suleyman
AU - Hansson, A
AU - Spaanenburg, Lambert
AU - Akesson, B
PY - 2005
Y1 - 2005
N2 - The paper investigates the potential for a packet switching network for real-time image processing by a Cellular Neural Network (CNN) implemented on a Field-Programmable Gate-Array (FPGA). The implementation of a CNN requires several parameter restrictions with respect to the universal concept. For instance, the number representation and the cloning template are often confined to respectively 8 bits and a neighborhood of 1. It has been shown that optimal (i.e. minimal level) CNN architectures as derived from a morphological specification of the desired operation lead to arbitrarily large templates. A subsequent transformation step can turn this into a sequence of smaller templates for a specified hardware platform. The existence of a generic platform that can already handle the universal CNN architecture for prototyping and verification eliminates this need for technology-driven performance degradation. The proposed packet switcher consists of a physical layer where the CNN nodal function is performed, a data-link layer where the nodal data are maintained, a network layer with the packet receiver and sender and the actual switch as element of the transport layer. This ISO/OSI compliant level-wise structure monitors the network parameters and autonomously adjusts for the size of the neighborhood. It separates the broadcast of the network variables from the actual computation, allowing each to be executed at its own speed. The concept is tested on a re-design of the ILVA architecture and has been shown to handle arbitrary neighborhoods and precision at a comparable size and speed (1 node per BlockRAM / multiplier module @220 MHz clock)
AB - The paper investigates the potential for a packet switching network for real-time image processing by a Cellular Neural Network (CNN) implemented on a Field-Programmable Gate-Array (FPGA). The implementation of a CNN requires several parameter restrictions with respect to the universal concept. For instance, the number representation and the cloning template are often confined to respectively 8 bits and a neighborhood of 1. It has been shown that optimal (i.e. minimal level) CNN architectures as derived from a morphological specification of the desired operation lead to arbitrarily large templates. A subsequent transformation step can turn this into a sequence of smaller templates for a specified hardware platform. The existence of a generic platform that can already handle the universal CNN architecture for prototyping and verification eliminates this need for technology-driven performance degradation. The proposed packet switcher consists of a physical layer where the CNN nodal function is performed, a data-link layer where the nodal data are maintained, a network layer with the packet receiver and sender and the actual switch as element of the transport layer. This ISO/OSI compliant level-wise structure monitors the network parameters and autonomously adjusts for the size of the neighborhood. It separates the broadcast of the network variables from the actual computation, allowing each to be executed at its own speed. The concept is tested on a re-design of the ILVA architecture and has been shown to handle arbitrary neighborhoods and precision at a comparable size and speed (1 node per BlockRAM / multiplier module @220 MHz clock)
KW - technology-driven performance degradation
KW - field-programmable gate-array
KW - cellular neural network
KW - real-time image processing
KW - network-on-chip implementation
KW - packet switching network
U2 - 10.1117/12.608534
DO - 10.1117/12.608534
M3 - Paper in conference proceeding
VL - 5839
SP - 341
EP - 352
BT - Proceedings of the SPIE - The International Society for Optical Engineering
PB - SPIE
T2 - Bioengineered and Bioinspired Systems II
Y2 - 9 May 2005
ER -