(2024)
BiRD: Bi-directional Input Reuse Dataflow for Enhancing Depthwise Convolution Performance on Systolic Arrays.
IEEE TRANSACTIONS ON COMPUTERS.
73,
12
(2023)
FlexKA: A Flexible Karatsuba Multiplier Hardware Architecture for Variable-Sized Large Integers.
IEEE ACCESS.
11,
(2022)
FARNN: FPGA-GPU Hybrid AccelerationPlatform for Recurrent Neural Networks.
IEEE Transactions on Parallel and Distributed Systems.
33,
7
(2021)
SpecMCTS: Accelerating Monte Carlo Tree Search Using Speculative Tree Traversal.
IEEE ACCESS.
9,
1
(2021)
RiSA: A Reinforced Systolic Array for Depthwise Convolutions and Embedded Tensor Reshaping.
ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS.
20,
5
학술회의논문
(2025)
GraphAccel: An In-Storage Accelerator for Efficient Graph-Based Vector Similarity Search Using Page Packing and Speculative Search Optimization.
Design Automation Conference.
미국
(2025)
SPARQ: An Accelerator Architecture for Large Language Models with Joint Sparsity and Quantization Techniques.
ACM SIGPLAN/SIGBED Conference on Languages, Compilers and Tools for Embedded Systems.
대한민국
(2025)
Detecting Cache-based Side-Channel Attacks by Leveraging Mesh Interconnect Traffic Monitoring.
ACM SIGAPP Symposium on Applied Computing.
이탈리아
(2022)
ES4D: Accelerating Exact Similarity Search for High-Dimensional Vectors via Vector Slicing and In-SSD Computation.
IEEE International Conference on Computer Design.
미국
(2022)
Know Your Neighbor: Physically Locating Xeon Processor Cores on the Core Tile Grid.
Design Automation and Test in Europe Conference.
벨기에