Call for Paper - August 2022 Edition
IJCA solicits original research papers for the August 2022 Edition. Last date of manuscript submission is July 20, 2022. Read More

Evolution of GPUs, moving towards GPGPUs: A Survey

IJCA Proceedings on Technical Symposium on Emerging Technologies in Computer Science
© 2016 by IJCA Journal
TSETCS 2016 - Number 2
Year of Publication: 2016
Shivam Bharadwaj
Tejas Upmanyu
Sandeep Saxena

Shivam Bharadwaj, Tejas Upmanyu and Sandeep Saxena. Article: Evolution of GPUs, moving towards GPGPUs: A Survey. IJCA Proceedings on Technical Symposium on Emerging Technologies in Computer Science TSETCS 2016(2):20-27, June 2016. Full text available. BibTeX

	author = {Shivam Bharadwaj and Tejas Upmanyu and Sandeep Saxena},
	title = {Article: Evolution of GPUs, moving towards GPGPUs: A Survey},
	journal = {IJCA Proceedings on Technical Symposium on Emerging Technologies in Computer Science},
	year = {2016},
	volume = {TSETCS 2016},
	number = {2},
	pages = {20-27},
	month = {June},
	note = {Full text available}


Graphics Processing Units (GPUs) broke out by the end of 1990s, devoted to the goal of providing ubiquitous interactive 3D graphics which, a few years back then, was a far-fetched dream. By the end of decade, the technology grew exponentially, with nearly every computer containing a GPU, providing a high performance, visually rich, brilliant 3D computer graphics. This unprecedented growth was a cumulative outcome of rising demand for high-quality games, manufacturing processes advancements, and employment of inherent parallelism for computation. Today, the raw computational power of a GPU dwarfs that of the most powerful CPU, and the gap is continuously widening. World's most powerful supercomputers (e. g. , Tianhe-2(China) and Titan(USA)) use GPUs at their core [1]. This paper provides a review of GPU technology, overview of general purpose application of GPUs, architectural highlights, Enhancement of GPGPUs. In the end, we take a look at future research directions and challenges to parallel computing chips.


  • URL: http://www. top500. org/lists/2015/06/.
  • Gordon E. Moore, the co-founder and chairman emeritus of Intel and Fairchild Semiconductor.
  • URL: http://www. nvidia. com/object/tesla-servers. html.
  • NVIDIA Corporation. CUDA Toolkit 4. 0. http://developer. nvidia. com/ category/zone/cuda- zone.
  • AMD Close to the Metal (CTM). http://www. amd. com/.
  • The AMD Fusion Family of APUs. http://sites. amd. com/us/fusion/apu/ Pages/fusion. aspx.
  • URL: https://en. wikipedia. org/wiki/Sandy_Bridge.
  • URL: http://www. androidauthority. com/arm-mali-closer-look-605021/.
  • Khronos Group. OpenCL - the open standard for parallel programming on heterogeneous systems. http://www. khronos. org/opencl/
  • URL: http://allegroviva. com/gpu-computing/difference-between-gpu-and-cpu/#prettyPhoto
  • URL:http://www. nvidia. com/object/what-is-gpu-computing. html
  • 'An Introduction to Modern GPU architecture' by Ashu Rege, Director of Developer Technology, NVIDIA
  • Preliminary views on GPU technology by Burton Smith, Technical Fellow, Microsoft Formerly, Chief Scientist at Cray
  • "The Architecture and Evolution of CPU-GPU Systems for General Purpose Computing," Manish Arora, University of California, San Diego.
  • URL:http://www. nvidia. in/object/gpu-computing-in. html
  • E. Lindholm et al. NVIDIA Tesla: A unified graphics and computing ?architecture. IEEE Micro, 2008.
  • C. M. Wittenbrink et al. Fermi GF100 gpu architecture. IEEE Micro, ?2011.
  • NVIDIA's next generation cuda compute architecture: Kepler GK110. Technical report, 2012. ?
  • URL: http://devblogs. nvidia. com/parallelforall/maxwell-most-advanced-cuda-gpu-ever-made/
  • URL:http://www. karlrupp. net/2013/06/cpu-gpu-and-mic-hardware-characteristics-over-time/
  • URL:http://www. pcgamer. com/hardware-report-card-nvidia-vs-amd/
  • J. T. Adriaens et al. The case for gpgpu spatial multitasking. High Performance Computer Architecture, 2012.
  • J. Nickolls et al. The GPU computing era. IEEE Micro, 2010.
  • V. W. Lee et al. Debunking the 100x GPU vs. CPU myth: An evaluation of throughput computing on CPU and GPU. In International Symposium on Computer Architecture, 2010
  • Steven Muchnick. Advanced Compiler Design and Imple- mentation. Morgan Kaufmann, 1997
  • W. W. Fung, I. Sham, G. Yuan, and T. M. Aamodt. Dynamic Warp Formation and Scheduling for Efficient GPU Control Flow. In 40th International Symposium on Microarchitec- ture (MICRO-40), December 2007.
  • Collange, Sylvain. Stack-less SIMT Reconvergence at Low Cost, 2011. ?
  • Minsoo Rhu, Mattan Eraz, The Dual-Path Execution Model for Efficient GPU Control Flow, 2013.
  • J. Meng, D. Tarjan, and K. Skadron. Dynamic Warp Subdivision for Integrated Branch and Memory Divergence Tolerance. In 37th International Symposium on Computer Architecture (ISCA-37), 2010.
  • V. Narasiman et al. Improving GPU performance via large warps and two-level warp scheduling. In International Symposium on Microarchitecture, 2011.
  • S. W Keckler et al. GPUs and The Future of Parallel Computing. IEEE Micro,2011.
  • URL: http://www. extremetech. com/gaming/201417-nvidias-2016-roadmap-shows-huge-performance-gains-from-upcoming-pascal-architecture.