USING A GPU FOR REAL-TIME DIGITAL SIGNAL PROCESSING
Abstract
This paper is devoted to the development of energy-efficient implementations of digital signal processing algorithms in MIMO radar for estimating target parameters on computers with different architectures. In accordance with the global trend, the possibility of using computers with parallel architecture for digital processing of broadband radar signals is being considered. The authors proposed an implementation of the procedure for processing the reflected signal of MIMO radar using the technology of general computing on graphics cards (GPGPU). The performance of the developed solution was assessed on various GPUs with different microarchitectures. A criterion for evaluating the performance of a processing algorithm is proposed in the form of the ratio of the algorithm's throughput to the peak throughput of the computer's memory. A numerical assessment of the efficiency of using the computer's memory bandwidth of the developed algorithm was carried out in comparison with known implementations on the GPU. The purpose of this work is to detect and evaluate target parameters in real time using MIMO radar, using a commercially available computer with the minimum possible weight and size characteristics. To achieve the set research purpose, the following problems were solved: – selection and adaptation of algorithms that allow the assessment of target parameters in MIMO radar; – implementation of selected algorithms taking into account the architecture of the computer, allowing for an assessment of the target background situation in real time; – assessment of the performance of the resulting solution. In the process of developing an algorithm for digital processing of a MIMO radar signal, several options for implementing the algorithm were analyzed taking into account the architecture of a parallel computer, which made it possible to process a radio image frame consisting of 8 million complex samples in less than 50 ms. by NVIDIA Jetson AGXXavier GPU. The inverse relationship between frame processing time and the peak GPU memory bandwidth is shown. A criterion for evaluating the performance of the processing algorithm is proposed. A numerical assessment of the efficiency of using the computer's memory bandwidth of the developed algorithm was carried out in comparison with known implementations on the GPU. The gain of the developed algorithm is on average 5 times compared to the results obtained by other authors. Compared to an FPGA, implementing 2D FFT on a GPU is 17 times faster. The practical significance of the functional software developed by the authors does not impose any restrictions on the number of receiving and transmitting channels and can be used for signal processing in MIMO radars with a large number of channels.
References
Deep Net, Remote Sensing, 2022, Vol. 15, No. 1, pp. 1-21.
2. Lobach V.T., Kas'yanov A.O., Potipak M.V., Sumatokhin K.V. Osobennosti tsifrovogo formirovaniya
diagrammy napravlennosti MIMO-radara [Features of digital beamforming of MIMO radar], Antenny
[Antennas], 2016, No. 7, pp. 47-52.
3. Lobach V., Kasyanov A., Potipak M. Digital beam forming in MIMO-radar with frequency diversity,
2016 IEEE East-West Design & Test Symposium (EWDTS). IEEE. 14-17 Oct. 2016, pp. 1-4.
4. Karim B.A., Ali H.K. Computationally efficient MUSIC based DOA estimation algorithm for FMCW
radar, Journal of Electronic Science and Technology, 2023, Vol. 21, No. 1, pp. 1-18.
5. Singh H., Chattopadhyay A. Multi-target Range and Angle detection for MIMO-FMCW radar with
limited antennas, arXiv preprint arXiv:2302.14327, 2023.
6. Grove R.L. MIMO radar Systems and Algorithms-Imperfections and Calibration. Copenhagen. Technical
University of Denmark, 2022, 189 p.
7. Patrick D., Huang Y., Brennan P.W. FMCW based MIMO imaging radar, ARMMS Conference. 7-8
Apr. 2014, Vol. 1, pp. 12-34.
8. Eugin H., Lee J. Hardware architecture design and implementation for FMCW radar signal processing
algorithm, Proceedings of the 2014 Conference on Design and Architectures for Signal and Image
Processing. IEEE. 8-10 Oct. 2014, pp. 1-6.
9. Fixed-Point and Floating-Point FMCW Radar Signal Processing with Tensilica DSPs, White paper.
– 2022. Available at: https://www.cadence.com/content/dam/cadence-www/global/en_US/ documents/
tools/ip/tensilica-ip/tip-fmcw-radar-wp.pdf.
10. Pitre E. et al. Comparison of massively parallel algorithms on graphics processing unit for MIMO
radar, e-Prime-Advances in Electrical Engineering. Electronics and Energy, 2022, Vol. 2, pp. 1-9.
11. Liu G. et al. A GPU based real time processing system for frequency division multiple input multiple
output radar, IET Radar. Sonar & Navigation, 2023, Vol. 17, No. 10, pp. 1524-1537.
12. Heo J. et al. FPGA implementation of an efficient FFT processor for FMCW radar signal processing,
Sensors, 2021, Vol. 21, No. 19, pp. 1-16.
13. Damnjanović V.D., Petrović M.L., Milovanović V. M. On Hardware Implementations of Two-
Dimensional Fast Fourier Transform for Radar Signal Processing, IEEE EUROCON 2023-20th International
Conference on Smart Technologies. 6-8 Jul. 2023, pp. 400-405.
14. Texas Instruments. MIMO radar, Application Report SWRA554, 2018.
15. Doerry A.W. SAR ambiguous range suppression, Report SAND2006-5332. Sandia National Laboratories
(SNL). Albuquerque. NM. and Livermore. CA (United States), 2006.
16. Harris M. How to Access Global Memory Efficiently in CUDA C/C++ Kernels, Parallel Forall.
NVIDIA Corporation, 2013. Available at: https://developer.nvidia.com/blog/how-access-globalmemory-
efficiently-cuda-c-kernels/.
17. Luitjens J. Faster parallel reductions on Kepler, Parallel Forall. NVIDIA Corporation, 2014. Available
at: https://devblogs. nvidia. com/faster-parallel-reductions-kepler.
18. Sakharnykh N. GPUPro Tip: Fast histograms using shared atomics on Maxwell, Parallel Forall.
NVIDIA Corporation, 2015. Available at: https://developer.nvidia.com/blog/gpu-pro-tip-fasthistograms-
using-shared-atomics-maxwell/.
19. Kalentev O. et al. Connected component labeling on a 2D grid using CUDA, Journal of Parallel and
Distributed Computing, 2011, Vol. 71, No. 4, pp. 615-620.
20. Kas'yanov A.O. Potipak M.V., Evtushenko A.N. Identifikatsiya radiolokatsionnoy stantsii na osnove
analiza neprednamerennoy vnutriimpul'snoy modulyatsii [Identification of a radar station based on the
analysis of unintentional intrapulse modulation], Radiotekhnika [Radio engineering], 2023, Vol. 87,
No. 2, pp. 73-83.
21. Jia Z. et al. Dissecting the NVidia Turing T4 GPU via microbench marking, arXiv preprint arXiv:
1903.07486, 2019.