2 research outputs found
The PMS project: Poor Man's Supercomputer
We briefly describe the Poor Man's Supercomputer (PMS) project carried out at
Eotvos University, Budapest. The goal was to develop a cost effective,
scalable, fast parallel computer to perform numerical calculations of physical
problems that can be implemented on a lattice with nearest neighbour
interactions. To this end we developed the PMS architecture using PC components
and designed a special, low cost communication hardware and the driver software
for Linux OS. Our first implementation of PMS includes 32 nodes (PMS1). The
performance of PMS1 was tested by Lattice Gauge Theory simulations. Using SU(3)
pure gauge theory or bosonic MSSM on PMS1 we obtained 3Mflop
price-to-sustained performance for double and single precision operations,
respectively. The design of the special hardware and the communication driver
are freely available upon request for non-profit organizations.Comment: Latex, 13 pages, 6 figures included, minor additions, typos correcte
Better than $1/Mflops sustained: a scalable PC-based parallel computer for lattice QCD
We study the feasibility of a PC-based parallel computer for medium to large
scale lattice QCD simulations. The E\"otv\"os Univ., Inst. Theor. Phys. cluster
consists of 137 Intel P4-1.7GHz nodes with 512 MB RDRAM. The 32-bit, single
precision sustained performance for dynamical QCD without communication is 1510
Mflops/node with Wilson and 970 Mflops/node with staggered fermions. This gives
a total performance of 208 Gflops for Wilson and 133 Gflops for staggered QCD,
respectively (for 64-bit applications the performance is approximately halved).
The novel feature of our system is its communication architecture. In order to
have a scalable, cost-effective machine we use Gigabit Ethernet cards for
nearest-neighbor communications in a two-dimensional mesh. This type of
communication is cost effective (only 30% of the hardware costs is spent on the
communication). According to our benchmark measurements this type of
communication results in around 40% communication time fraction for lattices
upto 48^3\cdot96 in full QCD simulations. The price/sustained-performance ratio
for full QCD is better than 1.5/Mflops for
staggered) quarks for practically any lattice size, which can fit in our
parallel computer. The communication software is freely available upon request
for non-profit organizations.Comment: 14 pages, 3 figures, final version to appear in Comp.Phys.Com