主页
外文期刊
OA 期刊
电子期刊
外文会议
中文期刊
标准
网络数据库
专业机构
高级检索
关于我们
版权声明
使用帮助
会议文集
会议名
31st IEEE International Symposium on High Performance Computer Architecture (HPCA 2025)
中译名
《第三十一届IEEE国际高性能计算机体系架构研讨会,卷3》
机构
Institute of Electrical and Electronic Engineers (IEEE)
会议日期
1-5 March 2025
会议地点
Las Vegas, Nevada, USA
出版年
2025
馆藏号
357272
题名
作者
出版年
Hydra: Scale-out FHE Accelerator Architecture for Secure Deep Learning on FPGA
Yinghao Yang; Xicheng Xu; Haibin Zhang; Jie Song; Xin Tang; Hang Lu; Xiaowei Li
2025
WarpDrive: GPU-Based Fully Homomorphic Encryption Acceleration Leveraging Tensor and CUDA Cores
Guang Fan; Mingzhe Zhang; Fangyu Zheng; Shengyu Fan; Tian Zhou; Xianglong Deng; Wenxu Tang; Liang Kong; Yixuan Song; Shoumeng Yan
2025
MLPerf Power: Benchmarking the Energy Efficiency of Machine Learning Systems from μWatts to MWatts for Sustainable AI
Arya Tschand; Arun Tejusve Raghunath Rajan; Sachin Idgunji; Anirban Ghosh; Jeremy Holleman; Csaba Kiraly; Pawan Ambalkar; Ritika Borkar; Ramesh Chukka; Trevor Cockrell; Oliver Curtis; Grigori Fursin; Miro Hodak; Hiwot Kassa; Anton Lokhmotov; Dejan Miskovic; Yuechao Pan; Manu Prasad Manmathan; Liz Raymond; Tom St. John; Arjun Suresh; Rowan Taubitz; Sean Zhan; Scott Wasson; David Kanter; Vijay Janapa Reddi
2025
Enterprise Class Modular Cache Hierarchy
Craig Walters; Deanna Berger; Robert Sonnelitter; Alper Buyuktosunoglu
2025
Predicting DRAM-Caused Risky VMs in Large-Scale Clouds
Yaoguang Yong; Xiaoming Du; Xuhua Ma; Yuxiang Wang; Bin Yao; Xudong Zheng; Huite Yi
2025
Enhancing Large-Scale AI Training Efficiency: The C4 Solution for Real-Time Anomaly Detection and Communication Optimization
Jianbo Dong; Bin Luo; Jun Zhang; Pengcheng Zhang; Fei Feng; Yikai Zhu; Ang Liu; Zian Chen; Yi Shi; Hairong Jiao; Gang Lu; Yu Guan; Ennan Zhai; Wencong Xiao; Hanyu Zhao; Man Yuan; Siran Yang; Xiang Li; Jiamang Wang; Rui Men; Jianwei Zhang; Chang Zhou; Dennis Cai; Yuan Xie; Binzhang Fu
2025
Revisiting Reliability in Large-Scale Machine Learning Research Clusters
Apostolos Kokolis; Michael Kuchnik; John Hoffman; Adithya Kumar; Parth Malani; Faye Ma; Zachary DeVito; Shubho Sengupta; Kalyan Saladi; Carole-Jean Wu
2025
HILP: Accounting for Workload-Level Parallelism in System-on-Chip Design Space Exploration
Joseph Rogers; Lieven Eeckhout; Magnus Jahre
2025
CORDOBA: Carbon-Efficient Optimization Framework for Computing Systems
Mariam Elgamal; Doug Carmean; Elnaz Ansari; Okay Zed; Ramesh Peri; Srilatha Manne; Udit Gupta; Gu-Yeon Wei; David Brooks; Gage Hills; Carole-Jean Wu
2025
Architecting Space Microdatacenters: A System-level Approach
Nathan Bleier; Rick Eason; Michael Lembeck; Rakesh Kumar
2025
ARTEMIS: Agile Discovery of Efficient Real-Time Systems-on-Chips in the Heterogeneous Era
Subhankar Pal; Aporva Amarnath; Behzad Boroujerdian; Augusto Vega; Alper Buyuktosunoglu; John-David Wellman; Vijay Janapa Reddi; Pradip Bose
2025
LEGO: Spatial Accelerator Generation and Optimization for Tensor Applications
Yujun Lin; Zhekai Zhang; Song Han
2025
DynamoLLM: Designing LLM Inference Clusters for Performance and Energy Efficiency
Jovan Stojkovic; Chaojie Zhang; Inigo Goiri; Josep Torrellas; Esha Choukse
2025
throttLL'eM: Predictive GPU Throttling for Energy Efficient LLM Inference Serving
Andreas Kosmas Kakolyris; Dimosthenis Masouros; Petros Vavaroutsos; Sotirios Xydis; Dimitrios Soudris
2025
RpcNIC: Enabling Efficient Datacenter RPC Offloading on PCIe-attached SmartNICs
Jie Zhang; Hongjing Huang; Xuzheng Chen; Xiang Li; Jieru Zhao; Ming Liu; Zeke Wang
2025
NVMePass: A Lightweight, High-performance and Scalable NVMe Virtualization Architecture with I/O Queues Passthrough
Yiquan Chen; Zhen Jin; Yijing Wang; Yi Chen; Jiexiong Xu; Hao Yu; Jinlong Chen; Wenhai Lin; Kanghua Fang; Keyao Zhang; Chengkun Wei; Qiang Liu; Yuan Xie; Wenzhi Chen
2025
Warped-Compaction: Maximizing GPU Register File Bandwidth Utilization via Operand Compaction
Eunbi Jeong; Ipoom Jeong; Myung Kuk Yoon; Nam Sung Kim
2025
Cooperative Warp Execution in Tensor Core for RISC-V GPGPU
Abubakr Nada; Giuseppe Maria Sarda; Erwan Lenormand
2025
SparseWeaver: Converting Sparse Operations as Dense Operations on GPUs for Graph Workloads
Shinnung Jeong; Liam Paul Cooper; Ju Min Lee; Heelim Choi; Nicholas Parnenzini; Chihyo Ahn; Yongwoo Lee; Hanjun Kim; Hyesoon Kim
2025
HSMU-SpGEMM: Achieving High Shared Memory Utilization for Parallel Sparse General Matrix-Matrix Multiplication on Modern GPUs
Min Wu; Huizhang Luo; Fenfang Li; Yiran Zhang; Zhuo Tang; Kenli Li; Jeff Zhang; Chubo Liu
2025
1
2
国家科技图书文献中心
全球文献资源网
京ICP备05055788号-26
京公网安备11010202008970号 机械工业信息研究院 2018-2025