A CNN-Specific Integrated Processor
Forskningsoutput: Tidskriftsbidrag › Artikel i vetenskaplig tidskrift
Integrated Processors (IP) are meant to supply algorithm-specific cores to a micro-electronic system. They are usually developed separately, and upon success turned into a part, that either by programming or by configuration can be re-used within many systems. This paper looks at architectures for Cellular Neural Networks (CNN) to become realized as IP. First current digital implementations are reviewed, and the memory-processor bandwidth issues are analyzed. Then a generic view is taken on the structure of the network, and a new intra-communication protocol based on rotating wheels is proposed. It is shown that this provides for guaranteed high-performance with a minimal network interfaces. The resulting node is small and supports multi-level CNN designs, giving the system a 30-fold increase in capacity compared to classical designs. As it facilitates multiple operations on a single image, and single operations on multiple images, with minimal access to the external image memory, balancing the internal and external data transfer requirements can optimize the system operation. Special consideration is given to the treatment of the boundary nodes. In conventional digital CNN designs, such requires additional logic to handle the CNN value propagation scheme. In the new architecture, only a slight modification of the existing cells is necessary to model the boundary effect. A typical prototype for visual pattern recognition will house 4096 CNN cells with a 2% overhead for making it an IP.
|Enheter & grupper|
Ämnesklassifikation (UKÄ) – OBLIGATORISK
|Tidskrift||Eurasip Journal on Advances in Signal Processing|
|Status||Published - 2009|
|Peer review utförd||Ja|