Browsing LCS Publications by Author "Agarwal, Anant"
Now showing items 1-16 of 16
-
Addressing Partitioned Arrays in Distributed Memory Multiprocessors - the Software Virtual Memory Approach
Barua, Rajeev; Kranz, David; Agarwal, Anant (1994-12)Harnessing the full performance potential of cache-coherent distributed shared memory multiprocessors without inordinate user effort requires a compilation technology that can automatically manage multiple levels of memory ... -
Anatomy of a Message in the Alewife Multiprocessor
Kubiatowicz, John; Agarwal, Anant (1993-02) -
Baring it all to Software: The Raw Machine
Waingold, Elliot; Taylor, Michael; Sarkar, Vivek; Lee, Walter; Lee, Victor; e.a. (1997-03)Rapid advances in technology force a quest for computer architectures that exploit new opportunities and shed existing mechanisms that do not scale. Current architectures, such as hardware scheduled superscalars, are ... -
Closing the Window of Vulnerability in Multiphase Memory Transcations
Kubiatowicz, John; Chaiken, David; Agarwal, Anant (1992-06)Multiprocessor architects have begun to explore several mechanisms such as prefetching, context-switching and software-assisted dynamic cache-coherence, which transform single-phase memory transactions in conventional ... -
Compile-time Techniques for Processor Allocation in Macro Dataflow Graphs for Multiprocessors
Prasanna, G.N. Srinivasa; Agarwal, Anant (1992-06)When compiling a progam consisting of multiple nested loops for execution on a multiprocessor, processor allocation is the problem of determining the number of processors over which to partition each nested loop. This paper ... -
Experience with Fine-grain Synchronization in MIMD Machines for Preconditioned Conjugate Gradient
Yeung, Donald; Agarwal, Anant (1992-10)This paper discusses our experience with fine-grain synchronization for the preconditioned conjugate gradient method using the modified incomplete Cholesky factorization of the coefficient matrix as a preconditioner. This ... -
How to Build Scalable On-Chip ILP Networks for a Decentralized Architecture
Taylor, Michael Bedford; Lee, Walter; Frank, Matthew; Amarasinghe, Saman; Agarwal, Anant (2000-04)The era of billion transistors-on-a-chip is creating a completely different set of design constraints, forcing radically new microprocessor archiecture designs. This paper examines a few of the possible microarchitectures ... -
Limitless Directories: A Scalable Cache Coherence Scheme
Chaiken, David; Kubiatowicz, John; Agarwal, Anant (1991-06)Caches enhance the performance of multiprocessors by reducing network traffic and average memory access latency. However, cache-based systems must address the problem of cache coherence. We propose the LimitLESS directory ... -
Memory Assignment for Multiprocessor Caches Through Graph Coloring
Agarwal, Anant; Guttag, John; Papaefthymiou, Marios (1992-02)It has become apparent that the achieved performance of multiprocessors is heavily dependent upon the quality of the availabel compilers. In this paper we are concerned with compile-time techniques that can be used to ... -
The MIT Alewife Machine: A Large-scale Distributed-memory Multiprocessor
Agarwal, Anant; Chaiken, David; Johnson, Kirk; Kranz, David; Kubiatowicz, John; e.a. (1991-06)The Alewife multiprocessor project focuses on the architecture and design of a large-scale parallel machine. The machine uses a low dimension direct interconnection network to provide scalable communication band-width, ... -
Modeling Multiprogrammed Caches
Agarwal, AnantThis paper presents a simple, yet accurate, model for multiprogrammed caches and validates it against trace-driven simulation. The model takes into account nonstationary behavior of processes and process sharing. By making ... -
Scalar Operand Networks: On-chip interconnect for ILP in Partitioned Architechures
Taylor, Michael Bedford; Lee, Walter; Amarasinghe, Saman; Agarwal, Anant (2002-07) -
Software-Extended Coherent Shared Memory: Performance and Cost
Chaiken, David; Agarwal, Anant (1993-10)This paper evaluates the tradeoffs involved when designing a directory-based protocol that implements coherent shared memory through a combination of hardware and software mechanisms. The fundamental design decisions involve ... -
Space - Time Scheduling of Instruction-Level Parallelism on a Raw Machine
Lee, Walter; Barua, R.; Srikrishna, D.; Babb, Jonathan; Sarkar, V.; e.a. (1997-12)Advances in VLSI technology will enable chips with over a billion transistors within the next decade. Unfortunately, the centralized-resource architectures of modern microprocessors are ill-suited to exploit such advances. ... -
UDM: User Direct Messaging for General-Purpose Multiprocessing
Mackenzie, Kenneth; Kubiatowicz, John; Frank, Matthew; Lee, Walter; Victor, Lee; e.a. (1996-03)User Direct Messaging (UDM) allows user-level, processor-to- processor messaging to coexist with general multiprogramming and virtual memory. Direct messaging, where processors launch and receive messages in tens of cycles ... -
Waiting Algorithms for Synchornization in Large-scale Multiprocessors
Lim, Beng-Hong; Agarwal, Anant (1991-02)Through analysis and experiments, this paper investigates two-phase waiting algorithms to minimize the cost of waiting for synchronization in large-scale multiprocessors. In a two-phase algorithm, a thread first waits by ...