CORE
🇺🇦
make metadata, not war
Services
Services overview
Explore all CORE services
Access to raw data
API
Dataset
FastSync
Content discovery
Recommender
Discovery
OAI identifiers
OAI Resolver
Managing content
Dashboard
Bespoke contracts
Consultancy services
Support us
Support us
Membership
Sponsorship
Community governance
Advisory Board
Board of supporters
Research network
About
About us
Our mission
Team
Blog
FAQs
Contact us
research on high performance implementation mechanism of gotoblas general matrix-matrix multiplication
Authors
宋刚
张云泉
李玉成
蒋孟奇
Publication date
1 January 2008
Publisher
Abstract
对GOTOBLAS库(GOTO)的实现机制,尤其是其中的一般矩阵乘法部分的实现进行了分析.结合近年来的一些研究成果,讨论了如何高效地实现矩阵相乘操作,把存储层次对程序性能的影响提高到计算模型的高度.对比实验表明,GOTO库的性能远远高于没有考虑存储层次的一般BLAS库.证明了GOTO库性能上的优越性和将存储层次引入计算模型的必要性
Similar works
Full text
Available Versions
Institute Of Software, Chinese Academy Of Sciences
See this paper in CORE
Go to the repository landing page
Download from data provider
oai:ir.iscas.ac.cn:311060/4298
Last time updated on 30/12/2017