![]() |
Research Work |
![]() |
| VSSAD, Intel Massachusetts Inc., Hudson, MA |
|---|
|
|||||||||||||||||||||||||||||||||||||||||||
Current Status: |
Aamer Jaleel currently works with an advanced development group called VSSAD in Intel Massachusetts, Inc. Aamer's current research work focuses on memory system optimizations for CMPs and workload characterization. | ||||||||||||||||||||||||||||||||||||||||||
Research Interests: |
|||||||||||||||||||||||||||||||||||||||||||
Publications: |
|
||||||||||||||||||||||||||||||||||||||||||
Honors/Awards |
|||||||||||||||||||||||||||||||||||||||||||
Technical Reports: |
[UMD-SCA 2006] "CMP$im: A Binary Instrumentation Approach to Modeling Memory Behavior of Workloads on CMPs", Aamer Jaleel, Robert S. Cohn, Chi-Keung Luk, and Bruce Jacob. Technical Report - UMD-SCA-2006-01 [UMD-SCA 2005] "Disorder - The Reordering of Memory Instructions In Out-of-Order Systems", Aamer Jaleel and Bruce Jacob. Technical Report - UMD-SCA-2005-01 |
||||||||||||||||||||||||||||||||||||||||||
Theses/Proposals |
[Master's Thesis] "In-Line Interrupt Handling and Lock-Up Free TLBs", Aamer Jaleel. Master's Thesis, University of Maryland, College Park. April 2002. [Ph.D. Research Proposal] "The Effects of OoO Execution on the Memory System", Aamer Jaleel. Ph.D. Research Proposal, University of Maryland, College Park. November 2004.(Proposal Exam Slides) |
||||||||||||||||||||||||||||||||||||||||||
Non-Conference Talks: |
"Characterizing The LLC Cache Behavior of OpenMP Bioinformatic Workloads". Aamer Jaleel, May 2005, Intel Massachusetts Inc. "Design Trade-Offs of On-Chip Routers in High Performance Microprocessors". Aamer Jaleel, January 17th 2002, Intel Santa Clara. "The Impact of Out-of-Order Execution on Memory System Behavior", Summer Intern Presentation, Compaq Western Research Lab (WRL), August 29, 2001. | ||||||||||||||||||||||||||||||||||||||||||
Industry Internship Experience: |
My experiences as an intern in industry. |
||||||||||||||||||||||||||||||||||||||||||
Grad School Projects: |
Fully Buffered Dimms (FBDs): Exploring the different design parameters of implementing Fully Buffered Dimms. Aided in the process of implementing a performance model for fully buffered dimms based on information presented at Denali MemCon. Embedded Systems: Investigating hardware-based methods to improve the performance of embedded operating systems. Reverse engineered Texas Instruments embedded micro-controller (MSP430) in Verilog. The Verilog RTL supports interrupts, serial I/O, watch dog timer, and is capable of running an embedded OS (e.g. uCOS) and simple embedded applications. The RTL is expected to be fabricated on 0.25 .m technology through MOSIS. Expected PTQ is August 2004. My contributions to the Verilog RTL include the implementation, verification, and synthesis of the following modules: Serial I/O, SRAM, Register File, Watch Dog Timer, and the Control Logic for the CPU core. Page Based Commands for DRAM Systems: Investigated the need for replacing frequent, sequential reads and writes to memory with page based commands. Hacked and recompiled Linux Kernel 2.4.19 to gather statistics on frequency and parameter sizes of calls to functions memcpy and memset (in both user and kernel domains). Proposed the addition of simple new commands to be issued by the processor to the memory controller to implement page based commands. Discussed the possible improvements in latency as well as bandwidth on several representative SPEC2000 benchmarks. Impact of Aggressive Out-Of-Order Techniques on Memory Systems: Studied the impact of aggressive out of order techniques (increase in issue widths and reorder buffer sizes) on the performance of a system for several representative SPEC2000 benchmarks. This study contributed to a conference paper (HPCA'05). Lock-Up Free TLBs: Designed and simulated a novel method to speed up the processing of software managed interrupts, TLB interrupts in particular. This study was part of my Masters Thesis and contributed to two conference papers (ICCD'01, HiPC'01) and a journal paper in IEEE Transactions on Computers. Virtual Memory: Simulated the page table mechanisms of different architectures including Mach, Intel, PA-RISC, PowerPC, UltraSparc, PUMA, Winchip, and Ultrix. Demonstrated the characteristics about these mechanisms: whether they can guarantee performance or not, and whether they can complete in a reasonable amount of time or not. This study contributed to a journal paper in progress. Operating Systems: Designed a software environment to simulate a 16-bit pipelined architecture, with a simple operating system that ran atop the simulated architecture. Simulations were written in C; the operating system in the assembly code (DLX) of the simulated architecture. The full-system emulation will be used to teach future sections of ENEE 350: Computer Organization. |
||||||||||||||||||||||||||||||||||||||||||
Education: |
|||||||||||||||||||||||||||||||||||||||||||
Ph.D. Advisor: |
Dr. Bruce L. Jacob (blj at umd dot edu) www: http://www.ece.umd.edu/~blj/ |
||||||||||||||||||||||||||||||||||||||||||
Class Projects |
|||||||||||||||||||||||||||||||||||||||||||