Flexibility, Speed and Accuracy in VLIW Architectures Simulation and Modeling

Size: px
Start display at page:

Download "Flexibility, Speed and Accuracy in VLIW Architectures Simulation and Modeling"

Transcription

1 Flexibility, Speed and Accuracy in VLIW Architectures Simulation and Modeling IVANO BARBIERI, MASSIMO BARIANI, ALBERTO CABITTO, MARCO RAGGIO Department of Biophysical and Electronic Engineering University of Genoa Via Opera Pia 11A ITALY Abstract: - In this document the Instruction Set Architecture (ASI) simulation issue is discussed. Typical tradeoffs between Flexibility, Speed and Accuracy are shown. Based on hypothesis on Architecture approach (VLIW) and Applications of interest (DSP and Multimedia), this article presents a solution representing a challenging compromise in ASI simulation. A fast, accurate and flexible ASI simulation environment has been implemented using a Simulation cache, innovative pipeline status modeling (Three dimensional), and Simulation Oriented Hw description. Results on two architecture case studies have been reported to validate the described approach: TI TMS320C62x and ST200. Key Words: VLIW, ILP, Hw-Sw Co-design, DSP, Multimedia, Development Tools, Instruction Simulation, Simulation Speed, Simulation Accuracy. 1 Introduction The increasing of computational power requirements for DSP and Multimedia application and the needs of easy-to-program development environment has driven recent programmable devices toward VLIW (Very Long Instruction Word) [1] architectures and to Hw Sw co-design environment [2]. VLIW architecture allows generating optimized machine code from high-level languages exploiting Instruction Level Parallelism (ILP) [3]. Furthermore, applications requirements and time to market constraints are growing enormously transferring functionalities from Hardware to Software implementation [4] moving developers toward System on Chip programmable devices. System on Chip application driven design [4] seems to be the answer to fulfill DSP (Multimedia, telecommunication) application complex requirements. 2 VLIW Architecures and DSP Applications In the previous chapter we introduced ILP to describe how VLIW architectures optimize code execution. There are two possible points of view in considering ILP: the ILP available in a region of code and ILP achievable in the given architecture (Hw-ILP) [5]. The Software developer try to write code with ILP as close as possible the Hw-ILP. Elsewhere Architecture designer analyze the typical application code to match the same Hw-ILP in the Hw design process. Hw-ILP solutions are: Multiple Functional Unit to execute at the same time Multiple copies of functional units accessing different registers file Pipeline for Functional units with latency longer than one cycle Multimedia and in general digital signal processing applications have typically large available ILP [2], specific data acces requirements, repetitive numeric calculation, numeric fidelity, high memory bandwidth and real-time processing requirements [6]. A number of general-purpose processors are suitable for DSP task. Nevertheless DSP processors outperform general-purpose processors in costperformance rate and power consumption [7] [8]. One of the purpose for Architecture-exploration tools is to allows designers to find the best matching between available and achievable ILP through the interactions between code-development and architecture-parameters tuning tools (Application driven Architectures). Examples of this interaction are Instruction Set or Long Instruction Issue modifications [9].

2 The approach described in chapter 4 has been used to simulated state of the art VLIW architectures [10][11][12]. 3 Instruction Simulation Tradeoffs: Speed, Accuracy and Flexibility Instruction simulators are nowadays widely used in developing Application-driven architecture design. The architecture design process is strongly influenced by the tradeoff between simulation speed and accuracy: Cycle accurate or close to cycle accurate simulation have usually low performance [13] [14]. Better simulation performance can be obtained through vertical software optimization for a given target architecture, but in this case the tradeoff to take into account is between speed and flexibility. Interpretative re-configurable Instruction Set Simulator could efficiently model System on Chip (SoC) and run-time interactions between heterogeneous SoC parts (core, co-processors, etc.). VLIW-SIM ISS is intended to be a Sw application design support and architecture exploration tool. 4 VLIW-SIM: An Innovative Simulation Approach The VLIWSIM ISA Simulation environment is composed by a set of modules implementing pipeline, memory, Register file, I/D Cache, Instructions and System I/O modeling. Implementation requirement were Interpretive simulation approach, Efficient host memory allocation, Instruction Set Flexibility, Step by step pipeline status tracking, Simulation Speed and Accuracy. Further details on VLIW-SIM environment can be found in [9] [15]. In the following chapters (4.1 to 4.4) the main VLIW-SIM modeling approach is described. 4.1 Pipeline Modeling The pipeline status is represented in the simulation as a three-dimensional space (phase, operation, time) (Figure 1). The Phase axis represent pipeline phase. The Operation axis represents the instruction position in the Long Instruction Word. The Time axis represent the given time stamp. F D R W op4 op4 op1 op 2 op3 op4 op5 op 6 op7 op8 op8 op8 t t-1 t+1 Phase Figure 1 - Pipeline 3D Status for ST200 Time Operation The Pipeline status element is the operation internal representation. The simulation process is based on two dimensional arrays representing pipeline status on a given time stamp. Status evolutions take place using two pipeline status arrays representing the current and the following status (Figure 2). The first array s raw contains the last-fetched Long Instruction Word. The Pipeline Status progression is based on the following algorithm: instructions in a given pipeline phase in the current pipeline status are processed and transferred to the next pipeline phase in the following pipeline status. How the instructions are processed depends on the instruction type and on the phase they are. After all the phases have been updated the current pipeline status is turned into the following. Load fetch packet n Pipeline phases PG n1 n2 n3 n4 n5 n6 n7 n8 PS PW PR m1 m2 m3 m4 m5 m6 m7 m8 DP DC E3 E4 E5 Current Status Figure 2 - Pipeline Status Arrays for TI C62x 4.2 Simulation Cache Pipeline phases PG n1 n2 n3 n4 n5 n6 n7 n8 PS PW PR m1 m2 m3 m4 m5 m6 m7 m8 DP DC E3 E4 E5 Following Status Performance tests and code profiling have been performed on the simulator. The code profiling has shown that a major over-head is the instruction simulation, and in particular decode and dispatch are the two more frequently executed modules. Moreover typical VLIW applications are repetitive on small piece of code. This feature and the VLIW static scheduling allow introducing the concept of Simulation Locality: large part of code are iteratively simulated, therefore some of the simulation

3 internal data (e.g. decoded and dispatched instructions) could be re-used, saving simulation time. This idea results in the implementation of a Simulation Cache: a fast access memory based on spatial and temporal locality containing the last N fetch packet with already-processed simulation data. The cache management mechanism algorithm detects hit/miss on cached fetch packets and replaces (following a common block replacement algorithm) dismissible packets. Simulation cache and cache block sizes are definitely critical parameters for simulator performance. To improve significantly the simulated instruction per second (sips) rate simulation cache size should allow critical loop code placement. Measures on a wide set of multimedia applications have been performed to verify locality and to best fit cache and block (cache line) size. VLIW application locality hypothesis and measures on typical multimedia applications (Figure 3) allow expecting significant improvements introducing the simulation-cache in the simulation environment tot acc temp loc A00 422B00 422C00 422D00 422DE0 422F Figure 3 - H.263+ Code Temporal and Spatial Locality 4.3 Simulation-Oriented Hardware Description In order to implement a set of tools capable to simulate a generic VLIW processor, the simulation environment should process an Hw description as input. This description has simulation accuracy as main purpose; therefore it should only be focused on those Hw aspects relevant to simulation: Decode Architecture (Instruction Decoding Masks, instructions fields position in the codeword, field size and meaning) Branch architecture, Pipeline phase description (Duration, Size, mnemonic phase-name) and VLIW Parameterization. tot acc Decode Architecture: From the description, a set of C-like-Macros will be automatically created for each Instruction Mask, in order to extract instruction fields. Moreover each macro has to be associated with the proper entry in the Instruction Field Table (IFT). The IFT is a table containing all the possible meaning (operand, operation code, destination, flag, etc.) for an instruction field in a VLIW instruction, independently from position or size in the mask. Branch Architecture: The description supply pipeline phases where branch condition is evaluated and target address is computed. Pipeline description: Pipeline Number of phases and functions per phases (Fetch phases, Decode phases, Execute phases) I/D Cache Parameterization: Cache Size, Line(Block) Size, number of way (1 is direct mapping), Write on Fetch option, Block Replacement Algorithm. VLIW Parameterization: Long Instruction Word Size, Register File Size and organization (number of Register Banks and Bank s size), Control Register File size and organization. 4.4 Instruction Set Dynamic Generation Instruction Set Flexibility is a major feature to allow Application Driven Architecture exploration and architectural design evaluations [5][16]. The Instruction Set Dynamical generation Tool allows supplying to simulator a behavioral description of the Instruction Set. A language to describe general VLIW Instruction has been identified. For each instruction mnemonic name, instruction class, type of operand(s), destination, latency, operation code and the expression defining the instruction is supplied. If the instruction uses a control register, the description will contain the field to specify the used register (or parts of the register). Other characteristics can be specified depending on the instruction class. Instructions are divided into three different classes. Each class identifies a specific instruction type: memory operation, arithmeticlogical operation, branches. The description language allows the user to completely define instruction s behavior through the expression field. In the

4 expression field, it is possible specify in a C-like notation, all the relations between operands. The user-defined instruction set is taken as input of the Instruction Set Dynamical Generator (ISDG) parser. The parser analyze the description and produces an intermediate instructions representation used to produce the Instruction Set modules. 5 The Simulation Environment Performance on Two Case Studies In this chapter test on two VLIW target architectures are reported in order to validate VLIW-SIM simulation environment. The platform used for the test is a Pentium II 400 MHz, 128 MB RAM, Windows NT 4.0. OS. Two different benchmarks have been used: H.263+ [17] coder and G [18] [19] encoder and decoder, both implemented in C. Tests on H.263+ encoder have been performed with the following parameters: Test Sequence: foreman.yuv Number of encoded frames: 10 (from frame 0 (Intra) to frame 9) Video input format: QCIF (176 x 144 pixel) Quantization index for P-frame: 10 Quantization index for I-frame: 10 Motion estimation search algorithm: Improved Gradient Descent Search Motion estimation search window: 15 Half Pixel Motion Estimation type: subset half pixels Tests on G codec have been performed with the following parameters: Test Sequence: ITU standard Number of encoded and decoded frames: 20 (from frame 0 to frame 19) Audio input format: 8 KHz - 16 bit Output Rate: 5.3 Kbps 5.1 Texas Instruments TMS320C62x In the following test the target architecture is the TI TMS320C62x [10]. VLIWSIM performance and accuracy has been compared with TI state of the art simulator (Texas Instrument Fast Simulator TIFS). The Register files and the memory of the two simulators matched exactly at the end of tests. TIFS ,630,083 VLIW-SIM ,630,080 Table 1 H.263+: encoding 10 QCIF frames TIFS ,516,855 VLIW-SIM 70 10,516,923 Table 2 - G Coding-Decoding 20 frames 5.2 ST-Microelectronics ST200 In the following test the target architecture is the ST Microelectronics ST200 [12]. VLIWSIM performance and accuracy has been compared with ISS state of the art simulator (HP-ST Lx Instruction Set simulator). The Register files and the memory of the two simulators matched exactly at the end of tests. ISS 95 30,627,631 VLIW-SIM 97 30,365,220 Table 3 - H.263+ encoding 10 QCIF frames ISS 40 12,324,010 VLIW-SIM 42 12,311,062 Table 4 - G Coding-Decoding 20 frames 5.3 VLIW-SIM performance The following Table resumes the VLIW-SIM performance in terms of Simulated Instruction per Second (sips) in the described case studies for the selected target architectures. VLIW-SIM (ST200) VLIW-SIM (TI-C62xx) G ,752 sips 580,041 sips H ,237 sips 328,731 sips Table 5 - VLIW-SIM performance in simulated instruction per second

5 It should be noted that the ST200 ISS is optimised for a single architecture. 6 Conclusions In this paper Instruction simulation issues for Hardware-Software Co-Design has been discussed. The Simulation environment VLIW SIM has been presented as solution for VLIW architecture and Multimedia application design support tools. Simulation Locality and Simulation oriented Hardware description were introduced to approach flexibility speed and Accuracy in ISA simulation. Two case studies have been presented to validate the described approach. Acknowledgments This research is part of the M 2 EDYA project in collaboration with ST-Microelectronics and Hewlett Packard References [1] Joseph A. Fisher. Very long instruction word architectures and the ELI-512, Proceedings of the 10th Annual International Symposium on Computer Architecture, Stockholm, Sweden, June [2] V. Bhaskaran, K. Konstantinides, Image and Video Compression Standards. Algorithms and Architecture, Second Edition Kluwer Academic Publishers 1998 [3] B.R. Rau, J.A. Fisher. Instruction Level Parallelism The Journal of Supercomputing 7 May 1993 [4] A. Hoffmann, T. Kogel, A. Nohl, G. Braun, O. Schliebusch, O. Wahlen, A. Wieferink, H. Meyr, A Novel Methodology for the Design of Application- Specific Instruction-Set Processors (ASIPs) Using a Machine Description Language, IEEE Transaction on Computer-Aied Design of Integrated Circuits and System, Vol. 20, N. 11, November 2001 [5] P. Faraboschi, G. Desoli, J.A. Fisher The Latest Word in Digital and Media Processing IEEE Signal Processing Magazine, March 1998 [6] R.B. Lee, M.D. Smith, Media Processing: A New Design Target, IEEE Micro, August 1996 [7] P. Lapsley, J. Bier, A. Shoham, E.A. Lee, DSP Processor Fundamentals: Architectures and Features, IEEE Press series on Signal Processing, [8] Berkeley Design Technology Inc, "VLIW Architectures for DSP", DSP World/ICSPAT, Orlando Florida, November 1999 [9] I. Barbieri, M. Bariani, M. Raggio, C6XSIM: A VLIW Architecture Simulation Innovative Approach DCIS 99, Palma de Maiorca Spain, November 1999 [10] TI, TMS320C62x/C67x CPU and Instruction Set, Reference Guide, 1998 [11] TI, TMS320C64x Technical Overview, September 2000 [12] P. Faraboschi, J. Fisher, G. Brown, G. Desoli, F. Homewood, Lx: A Technology Platform for Customizable VLIW Embedded Processing, ISCA Vancouver, Canada June [13] K. Olukotun, M. Heinrich, D. Ofelt, Digital system simulation: Methodologies and examples, Proc. Design Automation Conf., June 1998, pp [14] J. Rowson, Hardware/Software co-simulation, Proc. Design Automation Conference, June 1994, pp [15] I. Barbieri, M. Bariani, M. Raggio, "A VLIW architecture simulator innovative approach for HW- SW co-design" ICM000 - International Conference on Multimedia and Expo 2000 July 2000, New York City. [16] R.K. Gupta G. De Micheli Hardware-Software cosynthesis for digital systems IEEE design and test of Computers, September 1993 [17] ITU-T Recommendation H.263, Video coding for low bitrate communication, Feb [18] ITU-T Recommendation G.723.1, Dual rate speech coder for multimedia communication transmitting at 5.3 and 6.3 kbit/s, October 1995 [19] S. M. Mishra, A. Balaram, Efficient Hardware- Software Co-Design for the G Algorithm Targeted At VoIP Applications, ICME New York City US, August 2000

Evolution of DSP Processors. Kartik Kariya EE, IIT Bombay

Evolution of DSP Processors. Kartik Kariya EE, IIT Bombay Evolution of DSP Processors Kartik Kariya EE, IIT Bombay Agenda Expected features of DSPs Brief overview of early DSPs Multi-issue DSPs Case Study: VLIW based Processor (SPXK5) for Mobile Applications

More information

Signal Processing in Mobile Communication Using DSP and Multi media Communication via GSM

Signal Processing in Mobile Communication Using DSP and Multi media Communication via GSM Signal Processing in Mobile Communication Using DSP and Multi media Communication via GSM 1 M.Sivakami, 2 Dr.A.Palanisamy 1 Research Scholar, 2 Assistant Professor, Department of ECE, Sree Vidyanikethan

More information

Dr. D. M. Akbar Hussain

Dr. D. M. Akbar Hussain Course Objectives: To enable the students to learn some more practical facts about DSP architectures. Objective is that they can apply this knowledge to map any digital filtering algorithm and related

More information

Chapter 4. Pipelining Analogy. The Processor. Pipelined laundry: overlapping execution. Parallelism improves performance. Four loads: Non-stop:

Chapter 4. Pipelining Analogy. The Processor. Pipelined laundry: overlapping execution. Parallelism improves performance. Four loads: Non-stop: Chapter 4 The Processor Part II Pipelining Analogy Pipelined laundry: overlapping execution Parallelism improves performance Four loads: Speedup = 8/3.5 = 2.3 Non-stop: Speedup p = 2n/(0.5n + 1.5) 4 =

More information

IMPLEMENTATION OF G.726 ITU-T VOCODER ON A SINGLE CHIP USING VHDL

IMPLEMENTATION OF G.726 ITU-T VOCODER ON A SINGLE CHIP USING VHDL IMPLEMENTATION OF G.726 ITU-T VOCODER ON A SINGLE CHIP USING VHDL G.Murugesan N. Ramadass Dr.J.Raja paul Perinbum School of ECE Anna University Chennai-600 025 Gm1gm@rediffmail.com ramadassn@yahoo.com

More information

A High Definition Motion JPEG Encoder Based on Epuma Platform

A High Definition Motion JPEG Encoder Based on Epuma Platform Available online at www.sciencedirect.com Procedia Engineering 29 (2012) 2371 2375 2012 International Workshop on Information and Electronics Engineering (IWIEE) A High Definition Motion JPEG Encoder Based

More information

SOFTWARE IMPLEMENTATION OF THE

SOFTWARE IMPLEMENTATION OF THE SOFTWARE IMPLEMENTATION OF THE IEEE 802.11A/P PHYSICAL LAYER SDR`12 WInnComm Europe 27 29 June, 2012 Brussels, Belgium T. Cupaiuolo, D. Lo Iacono, M. Siti and M. Odoni Advanced System Technologies STMicroelectronics,

More information

WHITEPAPER MULTICORE SOFTWARE DESIGN FOR AN LTE BASE STATION

WHITEPAPER MULTICORE SOFTWARE DESIGN FOR AN LTE BASE STATION WHITEPAPER MULTICORE SOFTWARE DESIGN FOR AN LTE BASE STATION Executive summary This white paper details the results of running the parallelization features of SLX to quickly explore the HHI/ Frauenhofer

More information

ASIP Solution for Implementation of H.264 Multi Resolution Motion Estimation

ASIP Solution for Implementation of H.264 Multi Resolution Motion Estimation Int. J. Communications, Network and System Sciences, 2010, 3, 453-461 doi:10.4236/ijcns.2010.35060 Published Online May 2010 (http://www.scirp.org/journal/ijcns/) ASIP Solution for Implementation of H.264

More information

Lesson 7. Digital Signal Processors

Lesson 7. Digital Signal Processors Lesson 7 Digital Signal Processors Instructional Objectives After going through this lesson the student would learn o Architecture of a Real time Signal Processing Platform o Different Errors introduced

More information

IMPLEMENTATION OF SOFTWARE-BASED 2X2 MIMO LTE BASE STATION SYSTEM USING GPU

IMPLEMENTATION OF SOFTWARE-BASED 2X2 MIMO LTE BASE STATION SYSTEM USING GPU IMPLEMENTATION OF SOFTWARE-BASED 2X2 MIMO LTE BASE STATION SYSTEM USING GPU Seunghak Lee (HY-SDR Research Center, Hanyang Univ., Seoul, South Korea; invincible@dsplab.hanyang.ac.kr); Chiyoung Ahn (HY-SDR

More information

REAL TIME DIGITAL SIGNAL PROCESSING. Introduction

REAL TIME DIGITAL SIGNAL PROCESSING. Introduction REAL TIME DIGITAL SIGNAL Introduction Why Digital? A brief comparison with analog. PROCESSING Seminario de Electrónica: Sistemas Embebidos Advantages The BIG picture Flexibility. Easily modifiable and

More information

Instruction Level Parallelism Part II - Scoreboard

Instruction Level Parallelism Part II - Scoreboard Course on: Advanced Computer Architectures Instruction Level Parallelism Part II - Scoreboard Prof. Cristina Silvano Politecnico di Milano email: cristina.silvano@polimi.it 1 Basic Assumptions We consider

More information

Hardware-Software Co-Design Cosynthesis and Partitioning

Hardware-Software Co-Design Cosynthesis and Partitioning Hardware-Software Co-Design Cosynthesis and Partitioning EE8205: Embedded Computer Systems http://www.ee.ryerson.ca/~courses/ee8205/ Dr. Gul N. Khan http://www.ee.ryerson.ca/~gnkhan Electrical and Computer

More information

Lecture 1: Introduction to Digital System Design & Co-Design

Lecture 1: Introduction to Digital System Design & Co-Design Design & Co-design of Embedded Systems Lecture 1: Introduction to Digital System Design & Co-Design Computer Engineering Dept. Sharif University of Technology Winter-Spring 2008 Mehdi Modarressi Topics

More information

4.4 Implementation Structures in FPGAs and DSPs. Presented by Lee Pucker President, ForwardLink Consulting

4.4 Implementation Structures in FPGAs and DSPs. Presented by Lee Pucker President, ForwardLink Consulting 4.4 Implementation Structures in FPGAs and DSPs Presented by Lee Pucker President, ForwardLink Consulting Agenda Case Study on Implementation Structures Synchronization in a GSM Network Option 1: DSP Implementation

More information

7/11/2012. Single Cycle (Review) CSE 2021: Computer Organization. Multi-Cycle Implementation. Single Cycle with Jump. Pipelining Analogy

7/11/2012. Single Cycle (Review) CSE 2021: Computer Organization. Multi-Cycle Implementation. Single Cycle with Jump. Pipelining Analogy CSE 2021: Computer Organization Single Cycle (Review) Lecture-10 CPU Design : Pipelining-1 Overview, Datapath and control Shakil M. Khan CSE-2021 July-12-2012 2 Single Cycle with Jump Multi-Cycle Implementation

More information

Compiler Optimisation

Compiler Optimisation Compiler Optimisation 6 Instruction Scheduling Hugh Leather IF 1.18a hleather@inf.ed.ac.uk Institute for Computing Systems Architecture School of Informatics University of Edinburgh 2018 Introduction This

More information

A High-Throughput Memory-Based VLC Decoder with Codeword Boundary Prediction

A High-Throughput Memory-Based VLC Decoder with Codeword Boundary Prediction 1514 IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, VOL. 10, NO. 8, DECEMBER 2000 A High-Throughput Memory-Based VLC Decoder with Codeword Boundary Prediction Bai-Jue Shieh, Yew-San Lee,

More information

Computer Science 246. Advanced Computer Architecture. Spring 2010 Harvard University. Instructor: Prof. David Brooks

Computer Science 246. Advanced Computer Architecture. Spring 2010 Harvard University. Instructor: Prof. David Brooks Advanced Computer Architecture Spring 2010 Harvard University Instructor: Prof. dbrooks@eecs.harvard.edu Lecture Outline Instruction-Level Parallelism Scoreboarding (A.8) Instruction Level Parallelism

More information

Project 5: Optimizer Jason Ansel

Project 5: Optimizer Jason Ansel Project 5: Optimizer Jason Ansel Overview Project guidelines Benchmarking Library OoO CPUs Project Guidelines Use optimizations from lectures as your arsenal If you decide to implement one, look at Whale

More information

A GENERIC ARCHITECTURE FOR SMART MULTI-STANDARD SOFTWARE DEFINED RADIO SYSTEMS

A GENERIC ARCHITECTURE FOR SMART MULTI-STANDARD SOFTWARE DEFINED RADIO SYSTEMS A GENERIC ARCHITECTURE FOR SMART MULTI-STANDARD SOFTWARE DEFINED RADIO SYSTEMS S.A. Bassam, M.M. Ebrahimi, A. Kwan, M. Helaoui, M.P. Aflaki, O. Hammi, M. Fattouche, and F.M. Ghannouchi iradio Laboratory,

More information

EN164: Design of Computing Systems Lecture 22: Processor / ILP 3

EN164: Design of Computing Systems Lecture 22: Processor / ILP 3 EN164: Design of Computing Systems Lecture 22: Processor / ILP 3 Professor Sherief Reda http://scale.engin.brown.edu Electrical Sciences and Computer Engineering School of Engineering Brown University

More information

Outline Simulators and such. What defines a simulator? What about emulation?

Outline Simulators and such. What defines a simulator? What about emulation? Outline Simulators and such Mats Brorsson & Mladen Nikitovic ICT Dept of Electronic, Computer and Software Systems (ECS) What defines a simulator? Why are simulators needed? Classifications Case studies

More information

Run-time Power Control Scheme Using Software Feedback Loop for Low-Power Real-time Applications

Run-time Power Control Scheme Using Software Feedback Loop for Low-Power Real-time Applications Run-time Power Control Scheme Using Software Feedback Loop for Low-Power Real-time Applications Seongsoo Lee Takayasu Sakurai Center for Collaborative Research and Institute of Industrial Science, University

More information

Data Transmission at 16.8kb/s Over 32kb/s ADPCM Channel

Data Transmission at 16.8kb/s Over 32kb/s ADPCM Channel IOSR Journal of Engineering (IOSRJEN) ISSN: 2250-3021 Volume 2, Issue 6 (June 2012), PP 1529-1533 www.iosrjen.org Data Transmission at 16.8kb/s Over 32kb/s ADPCM Channel Muhanned AL-Rawi, Muaayed AL-Rawi

More information

EE 382C EMBEDDED SOFTWARE SYSTEMS. Literature Survey Report. Characterization of Embedded Workloads. Ajay Joshi. March 30, 2004

EE 382C EMBEDDED SOFTWARE SYSTEMS. Literature Survey Report. Characterization of Embedded Workloads. Ajay Joshi. March 30, 2004 EE 382C EMBEDDED SOFTWARE SYSTEMS Literature Survey Report Characterization of Embedded Workloads Ajay Joshi March 30, 2004 ABSTRACT Security applications are a class of emerging workloads that will play

More information

Implementation of FPGA based Design for Digital Signal Processing

Implementation of FPGA based Design for Digital Signal Processing e-issn 2455 1392 Volume 2 Issue 8, August 2016 pp. 150 156 Scientific Journal Impact Factor : 3.468 http://www.ijcter.com Implementation of FPGA based Design for Digital Signal Processing Neeraj Soni 1,

More information

Distributed Vision System: A Perceptual Information Infrastructure for Robot Navigation

Distributed Vision System: A Perceptual Information Infrastructure for Robot Navigation Distributed Vision System: A Perceptual Information Infrastructure for Robot Navigation Hiroshi Ishiguro Department of Information Science, Kyoto University Sakyo-ku, Kyoto 606-01, Japan E-mail: ishiguro@kuis.kyoto-u.ac.jp

More information

Dynamic Scheduling I

Dynamic Scheduling I basic pipeline started with single, in-order issue, single-cycle operations have extended this basic pipeline with multi-cycle operations multiple issue (superscalar) now: dynamic scheduling (out-of-order

More information

Michael Clausen Frank Kurth University of Bonn. Proceedings of the Second International Conference on WEB Delivering of Music 2002 IEEE

Michael Clausen Frank Kurth University of Bonn. Proceedings of the Second International Conference on WEB Delivering of Music 2002 IEEE Michael Clausen Frank Kurth University of Bonn Proceedings of the Second International Conference on WEB Delivering of Music 2002 IEEE 1 Andreas Ribbrock Frank Kurth University of Bonn 2 Introduction Data

More information

Detector Implementations Based on Software Defined Radio for Next Generation Wireless Systems Janne Janhunen

Detector Implementations Based on Software Defined Radio for Next Generation Wireless Systems Janne Janhunen GIGA seminar 11.1.2010 Detector Implementations Based on Software Defined Radio for Next Generation Wireless Systems Janne Janhunen janne.janhunen@ee.oulu.fi 2 Outline Introduction Benefits and Challenges

More information

Design and Implementation of Signal Processing Systems: An Introduction

Design and Implementation of Signal Processing Systems: An Introduction Design and Implementation of Signal Processing Systems: An Introduction Yu Hen Hu (c) 1997-2013 by Yu Hen Hu 1 Outline Course Objectives and Outline, Conduct What is signal processing? Implementation Options

More information

RANA: Towards Efficient Neural Acceleration with Refresh-Optimized Embedded DRAM

RANA: Towards Efficient Neural Acceleration with Refresh-Optimized Embedded DRAM RANA: Towards Efficient Neural Acceleration with Refresh-Optimized Embedded DRAM Fengbin Tu, Weiwei Wu, Shouyi Yin, Leibo Liu, Shaojun Wei Institute of Microelectronics Tsinghua University The 45th International

More information

NOISE SHAPING IN AN ITU-T G.711-INTEROPERABLE EMBEDDED CODEC

NOISE SHAPING IN AN ITU-T G.711-INTEROPERABLE EMBEDDED CODEC NOISE SHAPING IN AN ITU-T G.711-INTEROPERABLE EMBEDDED CODEC Jimmy Lapierre 1, Roch Lefebvre 1, Bruno Bessette 1, Vladimir Malenovsky 1, Redwan Salami 2 1 Université de Sherbrooke, Sherbrooke (Québec),

More information

Dynamic MIPS Rate Stabilization in Out-of-Order Processors

Dynamic MIPS Rate Stabilization in Out-of-Order Processors Dynamic Rate Stabilization in Out-of-Order Processors Jinho Suh and Michel Dubois Ming Hsieh Dept of EE University of Southern California Outline Motivation Performance Variability of an Out-of-Order Processor

More information

AN EFFICIENT ALGORITHM FOR THE REMOVAL OF IMPULSE NOISE IN IMAGES USING BLACKFIN PROCESSOR

AN EFFICIENT ALGORITHM FOR THE REMOVAL OF IMPULSE NOISE IN IMAGES USING BLACKFIN PROCESSOR AN EFFICIENT ALGORITHM FOR THE REMOVAL OF IMPULSE NOISE IN IMAGES USING BLACKFIN PROCESSOR S. Preethi 1, Ms. K. Subhashini 2 1 M.E/Embedded System Technologies, 2 Assistant professor Sri Sai Ram Engineering

More information

Power Issues with Embedded Systems. Rabi Mahapatra Computer Science

Power Issues with Embedded Systems. Rabi Mahapatra Computer Science Power Issues with Embedded Systems Rabi Mahapatra Computer Science Plan for today Some Power Models Familiar with technique to reduce power consumption Reading assignment: paper by Bill Moyer on Low-Power

More information

A Framework for Fast Hardware-Software Co-simulation

A Framework for Fast Hardware-Software Co-simulation A Framework for Fast Hardware-Software Co-simulation Andreas Hoffmann, Tim Kogel, Heinrich Meyr Integrated Signal Processing Systems (ISS), RWTH Aachen Templergraben 55, 52056 Aachen, Germany hoffmann[kogel,meyr]@iss.rwth-aachen.de

More information

Performance Evaluation of Recently Proposed Cache Replacement Policies

Performance Evaluation of Recently Proposed Cache Replacement Policies University of Jordan Computer Engineering Department Performance Evaluation of Recently Proposed Cache Replacement Policies CPE 731: Advanced Computer Architecture Dr. Gheith Abandah Asma Abdelkarim January

More information

DURIP Distributed SDR testbed for Collaborative Research. Wednesday, November 19, 14

DURIP Distributed SDR testbed for Collaborative Research. Wednesday, November 19, 14 DURIP Distributed SDR testbed for Collaborative Research Distributed Software Defined Radar Testbed Collaborative research resource based on software defined radar (SDR) platforms that can adaptively modify

More information

Introduction to co-simulation. What is HW-SW co-simulation?

Introduction to co-simulation. What is HW-SW co-simulation? Introduction to co-simulation CPSC489-501 Hardware-Software Codesign of Embedded Systems Mahapatra-TexasA&M-Fall 00 1 What is HW-SW co-simulation? A basic definition: Manipulating simulated hardware with

More information

Design of a High Speed FIR Filter on FPGA by Using DA-OBC Algorithm

Design of a High Speed FIR Filter on FPGA by Using DA-OBC Algorithm Design of a High Speed FIR Filter on FPGA by Using DA-OBC Algorithm Vijay Kumar Ch 1, Leelakrishna Muthyala 1, Chitra E 2 1 Research Scholar, VLSI, SRM University, Tamilnadu, India 2 Assistant Professor,

More information

Power Reduction Technique in Coefficient Multiplications Through Multiplier Characterization

Power Reduction Technique in Coefficient Multiplications Through Multiplier Characterization Journal of VLSI Signal Processing 38, 101 113, 2004 c 2004 Kluwer Academic Publishers. Manufactured in The Netherlands. Power Reduction Technique in Coefficient Multiplications Through Multiplier Characterization

More information

Low-Power CMOS VLSI Design

Low-Power CMOS VLSI Design Low-Power CMOS VLSI Design ( 范倫達 ), Ph. D. Department of Computer Science, National Chiao Tung University, Taiwan, R.O.C. Fall, 2017 ldvan@cs.nctu.edu.tw http://www.cs.nctu.tw/~ldvan/ Outline Introduction

More information

Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech Coder

Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech Coder COMPUSOFT, An international journal of advanced computer technology, 3 (3), March-204 (Volume-III, Issue-III) ISSN:2320-0790 Simulation of Conjugate Structure Algebraic Code Excited Linear Prediction Speech

More information

A Novel Approach of Compressing Images and Assessment on Quality with Scaling Factor

A Novel Approach of Compressing Images and Assessment on Quality with Scaling Factor A Novel Approach of Compressing Images and Assessment on Quality with Scaling Factor Umesh 1,Mr. Suraj Rana 2 1 M.Tech Student, 2 Associate Professor (ECE) Department of Electronic and Communication Engineering

More information

ADVANCED EMBEDDED MONITORING SYSTEM FOR ELECTROMAGNETIC RADIATION

ADVANCED EMBEDDED MONITORING SYSTEM FOR ELECTROMAGNETIC RADIATION 98 Chapter-5 ADVANCED EMBEDDED MONITORING SYSTEM FOR ELECTROMAGNETIC RADIATION 99 CHAPTER-5 Chapter 5: ADVANCED EMBEDDED MONITORING SYSTEM FOR ELECTROMAGNETIC RADIATION S.No Name of the Sub-Title Page

More information

EDA for IC System Design, Verification, and Testing

EDA for IC System Design, Verification, and Testing EDA for IC System Design, Verification, and Testing Edited by Louis Scheffer Cadence Design Systems San Jose, California, U.S.A. Luciano Lavagno Cadence Berkeley Laboratories Berkeley, California, U.S.A.

More information

FPGA Based 70MHz Digital Receiver for RADAR Applications

FPGA Based 70MHz Digital Receiver for RADAR Applications Technology Volume 1, Issue 1, July-September, 2013, pp. 01-07, IASTER 2013 www.iaster.com, Online: 2347-6109, Print: 2348-0017 FPGA Based 70MHz Digital Receiver for RADAR Applications ABSTRACT Dr. M. Kamaraju

More information

Video Encoder Optimization for Efficient Video Analysis in Resource-limited Systems

Video Encoder Optimization for Efficient Video Analysis in Resource-limited Systems Video Encoder Optimization for Efficient Video Analysis in Resource-limited Systems R.M.T.P. Rajakaruna, W.A.C. Fernando, Member, IEEE and J. Calic, Member, IEEE, Abstract Performance of real-time video

More information

AI Application Processing Requirements

AI Application Processing Requirements AI Application Processing Requirements 1 Low Medium High Sensor analysis Activity Recognition (motion sensors) Stress Analysis or Attention Analysis Audio & sound Speech Recognition Object detection Computer

More information

COTSon: Infrastructure for system-level simulation

COTSon: Infrastructure for system-level simulation COTSon: Infrastructure for system-level simulation Ayose Falcón, Paolo Faraboschi, Daniel Ortega HP Labs Exascale Computing Lab http://sites.google.com/site/hplabscotson MICRO-41 tutorial November 9, 28

More information

Transcoding of Narrowband to Wideband Speech

Transcoding of Narrowband to Wideband Speech University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 2005 Transcoding of Narrowband to Wideband Speech Christian H. Ritz University

More information

Cooperative Cross-Layer Protection for Resource Constrained Mobile Multimedia Systems

Cooperative Cross-Layer Protection for Resource Constrained Mobile Multimedia Systems Center for Embedded Computer Systems University of California, Irvine Cooperative Cross-Layer Protection for Resource Constrained Mobile Multimedia Systems Kyoungwoo Lee Dissertation Oct 27, 2008 Center

More information

EMBEDDED systems are those computing and control

EMBEDDED systems are those computing and control 266 IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 6, NO. 2, JUNE 1998 Power Estimation of Embedded Systems: A Hardware/Software Codesign Approach William Fornaciari, Member, IEEE,

More information

Recent Advances in Simulation Techniques and Tools

Recent Advances in Simulation Techniques and Tools Recent Advances in Simulation Techniques and Tools Yuyang Li, li.yuyang(at)wustl.edu (A paper written under the guidance of Prof. Raj Jain) Download Abstract: Simulation refers to using specified kind

More information

When to use an FPGA to prototype a controller and how to start

When to use an FPGA to prototype a controller and how to start When to use an FPGA to prototype a controller and how to start Mark Corless, Principal Application Engineer, Novi MI Brad Hieb, Principal Application Engineer, Novi MI 2015 The MathWorks, Inc. 1 When to

More information

Fixed Point Lms Adaptive Filter Using Partial Product Generator

Fixed Point Lms Adaptive Filter Using Partial Product Generator Fixed Point Lms Adaptive Filter Using Partial Product Generator Vidyamol S M.Tech Vlsi And Embedded System Ma College Of Engineering, Kothamangalam,India vidyas.saji@gmail.com Abstract The area and power

More information

Hybrid QR Factorization Algorithm for High Performance Computing Architectures. Peter Vouras Naval Research Laboratory Radar Division

Hybrid QR Factorization Algorithm for High Performance Computing Architectures. Peter Vouras Naval Research Laboratory Radar Division Hybrid QR Factorization Algorithm for High Performance Computing Architectures Peter Vouras Naval Research Laboratory Radar Division 8/1/21 Professor G.G.L. Meyer Johns Hopkins University Parallel Computing

More information

FPGA Implementation of High Speed Infrared Image Enhancement

FPGA Implementation of High Speed Infrared Image Enhancement International Journal of Electronic Engineering Research ISSN 0975-6450 Volume 1 Number 3 (2009) pp. 279 285 Research India Publications http://www.ripublication.com/ijeer.htm FPGA Implementation of High

More information

Vol. 4, No. 4 April 2013 ISSN Journal of Emerging Trends in Computing and Information Sciences CIS Journal. All rights reserved.

Vol. 4, No. 4 April 2013 ISSN Journal of Emerging Trends in Computing and Information Sciences CIS Journal. All rights reserved. FPGA Implementation Platform for MIMO- Based on UART 1 Sherif Moussa,, 2 Ahmed M.Abdel Razik, 3 Adel Omar Dahmane, 4 Habib Hamam 1,3 Elec and Comp. Eng. Department, Université du Québec à Trois-Rivières,

More information

Cooperative Wireless Networking Using Software Defined Radio

Cooperative Wireless Networking Using Software Defined Radio Cooperative Wireless Networking Using Software Defined Radio Jesper M. Kristensen, Frank H.P Fitzek Departement of Communication Technology Aalborg University, Denmark Email: jmk,ff@kom.aau.dk Abstract

More information

An Efficent Real Time Analysis of Carry Select Adder

An Efficent Real Time Analysis of Carry Select Adder An Efficent Real Time Analysis of Carry Select Adder Geetika Gesu Department of Electronics Engineering Abha Gaikwad-Patil College of Engineering Nagpur, Maharashtra, India E-mail: geetikagesu@gmail.com

More information

Instruction Scheduling for Low Power Dissipation in High Performance Microprocessors

Instruction Scheduling for Low Power Dissipation in High Performance Microprocessors Instruction Scheduling for Low Power Dissipation in High Performance Microprocessors Abstract Mark C. Toburen Thomas M. Conte Department of Electrical and Computer Engineering North Carolina State University

More information

Chapter 16 - Instruction-Level Parallelism and Superscalar Processors

Chapter 16 - Instruction-Level Parallelism and Superscalar Processors Chapter 16 - Instruction-Level Parallelism and Superscalar Processors Luis Tarrataca luis.tarrataca@gmail.com CEFET-RJ L. Tarrataca Chapter 16 - Superscalar Processors 1 / 78 Table of Contents I 1 Overview

More information

VLSI System Testing. Outline

VLSI System Testing. Outline ECE 538 VLSI System Testing Krish Chakrabarty System-on-Chip (SOC) Testing ECE 538 Krish Chakrabarty 1 Outline Motivation for modular testing of SOCs Wrapper design IEEE 1500 Standard Optimization Test

More information

IJMIE Volume 2, Issue 5 ISSN:

IJMIE Volume 2, Issue 5 ISSN: Systematic Design of High-Speed and Low- Power Digit-Serial Multipliers VLSI Based Ms.P.J.Tayade* Dr. Prof. A.A.Gurjar** Abstract: Terms of both latency and power Digit-serial implementation styles are

More information

Multi-core Platforms for

Multi-core Platforms for 20 JUNE 2011 Multi-core Platforms for Immersive-Audio Applications Course: Advanced Computer Architectures Teacher: Prof. Cristina Silvano Student: Silvio La Blasca 771338 Introduction on Immersive-Audio

More information

An Design of Radix-4 Modified Booth Encoded Multiplier and Optimised Carry Select Adder Design for Efficient Area and Delay

An Design of Radix-4 Modified Booth Encoded Multiplier and Optimised Carry Select Adder Design for Efficient Area and Delay An Design of Radix-4 Modified Booth Encoded Multiplier and Optimised Carry Select Adder Design for Efficient Area and Delay 1. K. Nivetha, PG Scholar, Dept of ECE, Nandha Engineering College, Erode. 2.

More information

Design of High-Performance Intra Prediction Circuit for H.264 Video Decoder

Design of High-Performance Intra Prediction Circuit for H.264 Video Decoder JOURNAL OF SEMICONDUCTOR TECHNOLOGY AND SCIENCE, VOL.9, NO.4, DECEMBER, 2009 187 Design of High-Performance Intra Prediction Circuit for H.264 Video Decoder Jihye Yoo, Seonyoung Lee, and Kyeongsoon Cho

More information

7/19/2012. IF for Load (Review) CSE 2021: Computer Organization. EX for Load (Review) ID for Load (Review) WB for Load (Review) MEM for Load (Review)

7/19/2012. IF for Load (Review) CSE 2021: Computer Organization. EX for Load (Review) ID for Load (Review) WB for Load (Review) MEM for Load (Review) CSE 2021: Computer Organization IF for Load (Review) Lecture-11 CPU Design : Pipelining-2 Review, Hazards Shakil M. Khan CSE-2021 July-19-2012 2 ID for Load (Review) EX for Load (Review) CSE-2021 July-19-2012

More information

Adaptive Modulation and Coding for LTE Wireless Communication

Adaptive Modulation and Coding for LTE Wireless Communication IOP Conference Series: Materials Science and Engineering PAPER OPEN ACCESS Adaptive and Coding for LTE Wireless Communication To cite this article: S S Hadi and T C Tiong 2015 IOP Conf. Ser.: Mater. Sci.

More information

Datorstödd Elektronikkonstruktion

Datorstödd Elektronikkonstruktion Datorstödd Elektronikkonstruktion [Computer Aided Design of Electronics] Zebo Peng, Petru Eles and Gert Jervan Embedded Systems Laboratory IDA, Linköping University http://www.ida.liu.se/~tdts80/~tdts80

More information

Spatial Audio Transmission Technology for Multi-point Mobile Voice Chat

Spatial Audio Transmission Technology for Multi-point Mobile Voice Chat Audio Transmission Technology for Multi-point Mobile Voice Chat Voice Chat Multi-channel Coding Binaural Signal Processing Audio Transmission Technology for Multi-point Mobile Voice Chat We have developed

More information

SCALCORE: DESIGNING A CORE

SCALCORE: DESIGNING A CORE SCALCORE: DESIGNING A CORE FOR VOLTAGE SCALABILITY Bhargava Gopireddy, Choungki Song, Josep Torrellas, Nam Sung Kim, Aditya Agrawal, Asit Mishra University of Illinois, University of Wisconsin, Nvidia,

More information

A HIGH SPEED FFT/IFFT PROCESSOR FOR MIMO OFDM SYSTEMS

A HIGH SPEED FFT/IFFT PROCESSOR FOR MIMO OFDM SYSTEMS A HIGH SPEED FFT/IFFT PROCESSOR FOR MIMO OFDM SYSTEMS Ms. P. P. Neethu Raj PG Scholar, Electronics and Communication Engineering, Vivekanadha College of Engineering for Women, Tiruchengode, Tamilnadu,

More information

SIGNED PIPELINED MULTIPLIER USING HIGH SPEED COMPRESSORS

SIGNED PIPELINED MULTIPLIER USING HIGH SPEED COMPRESSORS INTERNATIONAL JOURNAL OF RESEARCH IN COMPUTER APPLICATIONS AND ROBOTICS ISSN 2320-7345 SIGNED PIPELINED MULTIPLIER USING HIGH SPEED COMPRESSORS 1 T.Thomas Leonid, 2 M.Mary Grace Neela, and 3 Jose Anand

More information

DELAY-POWER-RATE-DISTORTION MODEL FOR H.264 VIDEO CODING

DELAY-POWER-RATE-DISTORTION MODEL FOR H.264 VIDEO CODING DELAY-POWER-RATE-DISTORTION MODEL FOR H. VIDEO CODING Chenglin Li,, Dapeng Wu, Hongkai Xiong Department of Electrical and Computer Engineering, University of Florida, FL, USA Department of Electronic Engineering,

More information

An Area Efficient Decomposed Approximate Multiplier for DCT Applications

An Area Efficient Decomposed Approximate Multiplier for DCT Applications An Area Efficient Decomposed Approximate Multiplier for DCT Applications K.Mohammed Rafi 1, M.P.Venkatesh 2 P.G. Student, Department of ECE, Shree Institute of Technical Education, Tirupati, India 1 Assistant

More information

CSE 2021: Computer Organization

CSE 2021: Computer Organization CSE 2021: Computer Organization Lecture-11 CPU Design : Pipelining-2 Review, Hazards Shakil M. Khan IF for Load (Review) CSE-2021 July-14-2011 2 ID for Load (Review) CSE-2021 July-14-2011 3 EX for Load

More information

WiMAX Basestation: Software Reuse Using a Resource Pool. Arnon Friedmann SW Product Manager

WiMAX Basestation: Software Reuse Using a Resource Pool. Arnon Friedmann SW Product Manager WiMAX Basestation: Software Reuse Using a Resource Pool Cory Modlin Wireless Systems Architect cmodlin@ti.com L. N. Reddy Wireless Software Manager lnreddy@tataelxsi.co.in Arnon Friedmann SW Product Manager

More information

A SOFTWARE RE-CONFIGURABLE ARCHITECTURE FOR 3G AND WIRELESS SYSTEMS

A SOFTWARE RE-CONFIGURABLE ARCHITECTURE FOR 3G AND WIRELESS SYSTEMS A SOFTWARE RE-CONFIGURABLE ARCHITECTURE FOR 3G AND WIRELESS SYSTEMS E. Sereni 1, G. Baruffa 1, F. Frescura 1, P. Antognoni 2 1 DIEI - University of Perugia, Perugia, ITALY 2 Digilab2000 - Foligno (PG)

More information

CMP 301B Computer Architecture. Appendix C

CMP 301B Computer Architecture. Appendix C CMP 301B Computer Architecture Appendix C Dealing with Exceptions What should be done when an exception arises and many instructions are in the pipeline??!! Force a trap instruction in the next IF stage

More information

JDT LOW POWER FIR FILTER ARCHITECTURE USING ACCUMULATOR BASED RADIX-2 MULTIPLIER

JDT LOW POWER FIR FILTER ARCHITECTURE USING ACCUMULATOR BASED RADIX-2 MULTIPLIER JDT-003-2013 LOW POWER FIR FILTER ARCHITECTURE USING ACCUMULATOR BASED RADIX-2 MULTIPLIER 1 Geetha.R, II M Tech, 2 Mrs.P.Thamarai, 3 Dr.T.V.Kirankumar 1 Dept of ECE, Bharath Institute of Science and Technology

More information

HIGH QUALITY AUDIO CODING AT LOW BIT RATE USING WAVELET AND WAVELET PACKET TRANSFORM

HIGH QUALITY AUDIO CODING AT LOW BIT RATE USING WAVELET AND WAVELET PACKET TRANSFORM HIGH QUALITY AUDIO CODING AT LOW BIT RATE USING WAVELET AND WAVELET PACKET TRANSFORM DR. D.C. DHUBKARYA AND SONAM DUBEY 2 Email at: sonamdubey2000@gmail.com, Electronic and communication department Bundelkhand

More information

Audio Compression using the MLT and SPIHT

Audio Compression using the MLT and SPIHT Audio Compression using the MLT and SPIHT Mohammed Raad, Alfred Mertins and Ian Burnett School of Electrical, Computer and Telecommunications Engineering University Of Wollongong Northfields Ave Wollongong

More information

Evaluation of Kalman Filtering Based Channel Estimation for LTE-Advanced

Evaluation of Kalman Filtering Based Channel Estimation for LTE-Advanced International Journal of Computer Science and Telecommunications [Volume, Issue, August 11] 1 Evaluation of Kalman Filtering Based Channel Estimation for LTE-Advanced ISSN 7-333 Saqib Saleem and Qamar-ul-Islam

More information

Instructor: Dr. Mainak Chaudhuri. Instructor: Dr. S. K. Aggarwal. Instructor: Dr. Rajat Moona

Instructor: Dr. Mainak Chaudhuri. Instructor: Dr. S. K. Aggarwal. Instructor: Dr. Rajat Moona NPTEL Online - IIT Kanpur Instructor: Dr. Mainak Chaudhuri Instructor: Dr. S. K. Aggarwal Course Name: Department: Program Optimization for Multi-core Architecture Computer Science and Engineering IIT

More information

Digital Signal Processing. VO Embedded Systems Engineering Armin Wasicek WS 2009/10

Digital Signal Processing. VO Embedded Systems Engineering Armin Wasicek WS 2009/10 Digital Signal Processing VO Embedded Systems Engineering Armin Wasicek WS 2009/10 Overview Signals and Systems Processing of Signals Display of Signals Digital Signal Processors Common Signal Processing

More information

OVER THE REAL-TIME SELECTIVE ENCRYPTION OF AVS VIDEO CODING STANDARD

OVER THE REAL-TIME SELECTIVE ENCRYPTION OF AVS VIDEO CODING STANDARD Author manuscript, published in "EUSIPCO'10: 18th European Signal Processing Conference, Aalborg : Denmark (2010)" OVER THE REAL-TIME SELECTIVE ENCRYPTION OF AVS VIDEO CODING STANDARD Z. Shahid, M. Chaumont

More information

Department Computer Science and Engineering IIT Kanpur

Department Computer Science and Engineering IIT Kanpur NPTEL Online - IIT Bombay Course Name Parallel Computer Architecture Department Computer Science and Engineering IIT Kanpur Instructor Dr. Mainak Chaudhuri file:///e /parallel_com_arch/lecture1/main.html[6/13/2012

More information

Computer Aided Design of Electronics

Computer Aided Design of Electronics Computer Aided Design of Electronics [Datorstödd Elektronikkonstruktion] Zebo Peng, Petru Eles, and Nima Aghaee Embedded Systems Laboratory IDA, Linköping University www.ida.liu.se/~tdts01 Electronic Systems

More information

A HIGH PERFORMANCE HARDWARE ARCHITECTURE FOR HALF-PIXEL ACCURATE H.264 MOTION ESTIMATION

A HIGH PERFORMANCE HARDWARE ARCHITECTURE FOR HALF-PIXEL ACCURATE H.264 MOTION ESTIMATION A HIGH PERFORMANCE HARDWARE ARCHITECTURE FOR HALF-PIXEL ACCURATE H.264 MOTION ESTIMATION Sinan Yalcin and Ilker Hamzaoglu Faculty of Engineering and Natural Sciences, Sabanci University, 34956, Tuzla,

More information

Implementation of a Visible Watermarking in a Secure Still Digital Camera Using VLSI Design

Implementation of a Visible Watermarking in a Secure Still Digital Camera Using VLSI Design 2009 nternational Symposium on Computing, Communication, and Control (SCCC 2009) Proc.of CST vol.1 (2011) (2011) ACST Press, Singapore mplementation of a Visible Watermarking in a Secure Still Digital

More information

Using an FPGA based system for IEEE 1641 waveform generation

Using an FPGA based system for IEEE 1641 waveform generation Using an FPGA based system for IEEE 1641 waveform generation Colin Baker EADS Test & Services (UK) Ltd 23 25 Cobham Road Wimborne, Dorset, UK colin.baker@eads-ts.com Ashley Hulme EADS Test Engineering

More information

Low-Power Digital CMOS Design: A Survey

Low-Power Digital CMOS Design: A Survey Low-Power Digital CMOS Design: A Survey Krister Landernäs June 4, 2005 Department of Computer Science and Electronics, Mälardalen University Abstract The aim of this document is to provide the reader with

More information

Using SDR for Cost-Effective DTV Applications

Using SDR for Cost-Effective DTV Applications Int'l Conf. Wireless Networks ICWN'16 109 Using SDR for Cost-Effective DTV Applications J. Kwak, Y. Park, and H. Kim Dept. of Computer Science and Engineering, Korea University, Seoul, Korea {jwuser01,

More information

CS Computer Architecture Spring Lecture 04: Understanding Performance

CS Computer Architecture Spring Lecture 04: Understanding Performance CS 35101 Computer Architecture Spring 2008 Lecture 04: Understanding Performance Taken from Mary Jane Irwin (www.cse.psu.edu/~mji) and Kevin Schaffer [Adapted from Computer Organization and Design, Patterson

More information

Using Modern Design Tools To Evaluate Complex Communication Systems: A Case Study on QAM, FSK and OFDM Transceiver Design

Using Modern Design Tools To Evaluate Complex Communication Systems: A Case Study on QAM, FSK and OFDM Transceiver Design Using Modern Design Tools To Evaluate Complex Communication Systems: A Case Study on QAM, FSK and OFDM Transceiver Design SOTIRIS H. KARABETSOS, SPYROS H. EVAGGELATOS, SOFIA E. KONTAKI, EVAGGELOS C. PICASIS,

More information