MICRO 2022

Ingesting and Processing Data Efficiently for Machine Learning
Machine Learning for Better Medicine
- Canceled
System Requirements for Deep Learning Foundational Models
Using Processing-in-Memory to Accelerate Edge Machine Learning
AI4Physics: From Conceptualization to AI-Driven Discovery at Scale
SODA: An End-To-End Open-Source Hardware Compiler for Machine Learning Accelerators
Faster Learning on Slow Hardware

Day 3 (10/3, Mon)

Skipper: Enabling Efficient SNN Training Through Activation-Checkpointing and Time-Skipping
Going Further With Winograd Convolutions: Tap-Wise Quantization for Efficient Inference on 4x4 Tiles
- https://arxiv.org/pdf/2209.12982.pdf
Adaptable Butterfly Accelerator for Attention-Based NNs via Hardware and Algorithm Co-Design
- https://arxiv.org/pdf/2209.09570.pdf
DFX: A Low-Latency Multi-FPGA Appliance for Accelerating Transformer-Based Text Generation
- https://arxiv.org/pdf/2209.10797.pdf
HARMONY: Heterogeneity-Aware Hierarchical Management for Federated Learning System

Democratizing Customized Computing

How to make programmer w/o knowledge of circuits use FPGA easily?
- AutoSA
- AutoDSE
- GNN-DSE
- HeteroCL
But compile time too long…
- TAPA
MLIR? → 조사해볼만한듯

GenPIP: In-Memory Acceleration of Genome Analysis by Tight Integration of Basecalling and Read Mapping
- https://arxiv.org/pdf/2209.08600.pdf
BEACON: Scalable Near-Data-Processing Accelerators for Genome Analysis near Memory Pool with the CXL Support
Sparse Attention Acceleration with Synergistic In-Memory Pruning and On-Chip Recomputation
- https://arxiv.org/pdf/2209.00606.pdf
ICE: An Intelligent Cognition Engine with 3D NAND-based In-Memory Computing for Vector Similarity Search Acceleration

RemembERR: Leveraging Microprocessor Errata for Improving Design Testing and Validation
Datamime: Generating Representative Benchmarks by Automatically Synthesizing Datasets
An Architecture Interface and Offload Model for Low-Overhead, Near-Data, Distributed Accelerators
Towards Developing High Performance RISC-V Processors Using Agile Methodology

3D-FPIM: An Extreme Energy-Efficient DNN Acceleration System Using 3D NAND Flash-Based In-Situ PIM Unit
Sparseloop: An Analytical Approach to Sparse Tensor Accelerator Modeling
DeepBurning-SEG: Generating DNN Accelerators of Segment-Grained Pipeline Architecture
ANT: Exploiting Adaptive Numerical Data Type for Low-Bit Deep Neural Network Quantization
- https://arxiv.org/pdf/2208.14286.pdf
Ristretto: An Atomized Processing Architecture for Sparsity-Condensed Stream Flow in CNN

ℹ️

pat(이창림)
한국 서버 개발자