An experience with PyCUDA: Refactoring an existing implementation of a ray-surface intersection algorithm - 专知论文

会员服务 ·

0

CUDA · 代码 · SequeL · 调试策略 · binary ·

2023 年 5 月 3 日

An experience with PyCUDA: Refactoring an existing implementation of a ray-surface intersection algorithm

翻译：PyCUDA使用经验：基于光线-表面相交算法的现有实现重构

from arxiv, 14 pages. Keywords: PyCUDA, Python scripting, GPU Run-Time Code Generation (RTCG), ray-mesh intersection, open-source code, learning, shared experience

This article is a sequel to "GPU implementation of a ray-surface intersection algorithm in CUDA" (arXiv:2209.02878) [1]. Its main focus is PyCUDA which represents a Python scripting approach to GPU run-time code generation in the Compute Unified Device Architecture (CUDA) framework. It accompanies the open-source code distributed in GitHub which provides a PyCUDA implementation of a GPU-based line-segment, surface-triangle intersection test. The objective is to share a PyCUDA learning experience with people who are new to PyCUDA. Using the existing CUDA code and foundation from [1] as the starting point, we document the key changes made to facilitate a transition to PyCUDA. As the CUDA source for the ray-surface intersection test contains both host and device code and uses multiple kernel functions, these notes offer a substantive example and real-world perspective of what it is like to utilize PyCUDA. It delves into custom data structures such as binary radix tree and highlights some possible pitfalls. The case studies present a debugging strategy which may be used to examine complex C structures in device memory using standard Python tools without the CUDA-GDB debugger.

翻译：本文是《CUDA框架下光线-表面相交算法的GPU实现》（arXiv:2209.02878）[1]的续篇。重点聚焦于PyCUDA——这一通过Python脚本在统一计算设备架构（CUDA）框架中实现GPU运行时代码生成的方法。文章配合GitHub上分发的开源代码，提供了基于GPU的线段-表面三角形相交测试的PyCUDA实现。目的在于为PyCUDA初学者分享学习经验。以现有CUDA代码及文献[1]中的基础为起点，我们记录为促进向PyCUDA迁移所做的主要修改。由于光线-表面相交测试的CUDA源码同时包含主机与设备代码，并使用多个内核函数，本文档提供了利用PyCUDA的真实案例与业界视角。深入探讨了诸如二叉树等自定义数据结构，并指出了若干潜在陷阱。案例研究提出了一种调试策略，可借助标准Python工具（无需CUDA-GDB调试器）检查设备内存中的复杂C语言结构。

0

相关内容

CUDA

【2023新书】使用Python进行统计和数据可视化，554页pdf

【2023新书】使用Python进行统计和数据可视化，554页pdf

专知会员服务

130+阅读 · 2023年1月29日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

109+阅读 · 2020年5月1日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

安装TensorFlow 2.0 preview进行深度学习（附Jupyter Notebook）

安装TensorFlow 2.0 preview进行深度学习（附Jupyter Notebook）

专知

10+阅读 · 2019年1月11日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

【推荐】(TensorFlow)SSD实时手部检测与追踪（附代码）

【推荐】(TensorFlow)SSD实时手部检测与追踪（附代码）

机器学习研究会

11+阅读 · 2017年12月5日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

二氧化碳加氢合成甲酸纳米金催化剂的构建

国家自然科学基金

0+阅读 · 2016年12月31日

肥胖相关Hepatokine LECT2在肝脏中的调控及机制

国家自然科学基金

1+阅读 · 2015年12月31日

SMAD2调控ERK通路干预M2巨噬细胞活化在糖尿病肾病小鼠肾脏纤维化中的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

锂离子电池电极材料性能调控的界面效应研究

国家自然科学基金

0+阅读 · 2014年12月31日

蛋白激酶Wts/Lats的稳定性调控

国家自然科学基金

0+阅读 · 2014年12月31日

奶牛乳腺脂类合成代谢转录调控机制与基因网络构建

国家自然科学基金

0+阅读 · 2014年12月31日

氧（氮）桥联杯芳烃配位组装体的合成、结构和功能

国家自然科学基金

0+阅读 · 2012年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

深海放线菌Streptomyces sp. SCSIO 03032抗肿瘤天然产物Spiroindimicins生物合成研究

国家自然科学基金

0+阅读 · 2012年12月31日

富含半胱氨酸的酸性分泌蛋白SPARC在胃癌细胞中的表达和调控

国家自然科学基金

0+阅读 · 2009年12月31日

Numerical Approximations of a Class of Nonlinear Second-Order Boundary Value Problems using Galerkin-Compact Finite Difference Method

Arxiv

0+阅读 · 2023年6月16日

MementoHash: A Stateful, Minimal Memory, Best Performing Consistent Hash Algorithm

Arxiv

0+阅读 · 2023年6月16日

Live Exploration of AI-Generated Programs

Arxiv

0+阅读 · 2023年6月15日

UrbanIR: Large-Scale Urban Scene Inverse Rendering from a Single Video

Arxiv

0+阅读 · 2023年6月15日

Causal Modeling of Policy Interventions From Sequences of Treatments and Outcomes

Arxiv

0+阅读 · 2023年6月15日

Modernising the Design and Analysis of Prevalence Surveys for Neglected Tropical Diseases

Arxiv

0+阅读 · 2023年6月14日

Understanding the Role of Human Intuition on Reliance in Human-AI Decision-Making with Explanations

Arxiv

0+阅读 · 2023年6月14日

From Driver to Supervisor: Comparing Cognitive Load and EEG-based Attention Allocation across Automation Levels

Arxiv

0+阅读 · 2023年6月14日

Reinforcement Learning-Based Control of CrazyFlie 2.X Quadrotor

Arxiv

0+阅读 · 2023年6月14日

Recent Advances of Continual Learning in Computer Vision: An Overview

Recent Advances of Continual Learning in Computer Vision: An Overview

Arxiv

23+阅读 · 2021年9月23日

VIP会员

文章信息

相关主题

最新内容

“史诗怒火”行动：现代多域作战的重要节点

“史诗怒火”行动：现代多域作战的重要节点

专知会员服务

5+阅读 · 今天5:05

《下一代无线网络中的多无人机通信资源管理》

《下一代无线网络中的多无人机通信资源管理》

专知会员服务

4+阅读 · 今天5:00

《高分辨率模拟下的聚合战斗建模：以“会战交锋”场景为例》

《高分辨率模拟下的聚合战斗建模：以“会战交锋”场景为例》

专知会员服务

5+阅读 · 今天4:52

《人机协同在安全关键型操作决策中的应用》120页

《人机协同在安全关键型操作决策中的应用》120页

专知会员服务

3+阅读 · 今天4:43

网络防御与空中力量网络防护：21世纪空中力量历史与理论的启示

网络防御与空中力量网络防护：21世纪空中力量历史与理论的启示

专知会员服务

3+阅读 · 今天1:47

综述 | Memory for Large Language Models：大模型记忆机制全景

综述 | Memory for Large Language Models：大模型记忆机制全景

专知会员服务

6+阅读 · 7月29日

博士论文 | Riemannian Deep Learning：模块、网络与几何

博士论文 | Riemannian Deep Learning：模块、网络与几何

专知会员服务

3+阅读 · 7月29日

《越野作战环境下路径规划的多准则整数规划模型》

《越野作战环境下路径规划的多准则整数规划模型》

专知会员服务

9+阅读 · 7月29日

人工智能大语言模型引擎如何重塑全球冲突信息环境最新50页

人工智能大语言模型引擎如何重塑全球冲突信息环境最新50页

专知会员服务

7+阅读 · 7月29日

《防空系统对自主武器系统辩论中“有意义的人类控制”的启示》70页报告

《防空系统对自主武器系统辩论中“有意义的人类控制”的启示》70页报告

专知会员服务

6+阅读 · 7月29日

“对标ChatGPT”：乌军研发Marichka AI系统用于战场筹划

“对标ChatGPT”：乌军研发Marichka AI系统用于战场筹划

专知会员服务

10+阅读 · 7月29日

《同步多无人机系统中的故障与通信》

《同步多无人机系统中的故障与通信》

专知会员服务

4+阅读 · 7月29日

论文解读 | 医学图像修复中的扩散模型：挑战、分类与未来方向

论文解读 | 医学图像修复中的扩散模型：挑战、分类与未来方向

专知会员服务

5+阅读 · 7月28日

博士论文 | 从算法到基础模型：强化学习的统一视角

博士论文 | 从算法到基础模型：强化学习的统一视角

专知会员服务

11+阅读 · 7月28日

面向国防作战的最佳自主与蜂群无人机技术

面向国防作战的最佳自主与蜂群无人机技术

专知会员服务

7+阅读 · 7月28日

相关VIP内容

【2023新书】使用Python进行统计和数据可视化，554页pdf

【2023新书】使用Python进行统计和数据可视化，554页pdf

专知会员服务

130+阅读 · 2023年1月29日

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

【2020新书】自然语言处理Python与spaCy实践，216页pdf，NLP with Python

专知会员服务

109+阅读 · 2020年5月1日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

164+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《下一代无线网络中的多无人机通信资源管理》

《人机协同在安全关键型操作决策中的应用》120页

“史诗怒火”行动：现代多域作战的重要节点

《高分辨率模拟下的聚合战斗建模：以“会战交锋”场景为例》

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

安装TensorFlow 2.0 preview进行深度学习（附Jupyter Notebook）

安装TensorFlow 2.0 preview进行深度学习（附Jupyter Notebook）

专知

10+阅读 · 2019年1月11日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

【推荐】(TensorFlow)SSD实时手部检测与追踪（附代码）

【推荐】(TensorFlow)SSD实时手部检测与追踪（附代码）

机器学习研究会

11+阅读 · 2017年12月5日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

相关论文

Numerical Approximations of a Class of Nonlinear Second-Order Boundary Value Problems using Galerkin-Compact Finite Difference Method

Arxiv

0+阅读 · 2023年6月16日

MementoHash: A Stateful, Minimal Memory, Best Performing Consistent Hash Algorithm

Arxiv

0+阅读 · 2023年6月16日

Live Exploration of AI-Generated Programs

Arxiv

0+阅读 · 2023年6月15日

UrbanIR: Large-Scale Urban Scene Inverse Rendering from a Single Video

Arxiv

0+阅读 · 2023年6月15日

Causal Modeling of Policy Interventions From Sequences of Treatments and Outcomes

Arxiv

0+阅读 · 2023年6月15日

Modernising the Design and Analysis of Prevalence Surveys for Neglected Tropical Diseases

Arxiv

0+阅读 · 2023年6月14日

Understanding the Role of Human Intuition on Reliance in Human-AI Decision-Making with Explanations

Arxiv

0+阅读 · 2023年6月14日

From Driver to Supervisor: Comparing Cognitive Load and EEG-based Attention Allocation across Automation Levels

Arxiv

0+阅读 · 2023年6月14日

Reinforcement Learning-Based Control of CrazyFlie 2.X Quadrotor

Arxiv

0+阅读 · 2023年6月14日

Recent Advances of Continual Learning in Computer Vision: An Overview

Recent Advances of Continual Learning in Computer Vision: An Overview

Arxiv

23+阅读 · 2021年9月23日

相关基金

二氧化碳加氢合成甲酸纳米金催化剂的构建

国家自然科学基金

0+阅读 · 2016年12月31日

肥胖相关Hepatokine LECT2在肝脏中的调控及机制

国家自然科学基金

1+阅读 · 2015年12月31日

SMAD2调控ERK通路干预M2巨噬细胞活化在糖尿病肾病小鼠肾脏纤维化中的作用及机制

国家自然科学基金

0+阅读 · 2015年12月31日

锂离子电池电极材料性能调控的界面效应研究

国家自然科学基金

0+阅读 · 2014年12月31日

蛋白激酶Wts/Lats的稳定性调控

国家自然科学基金

0+阅读 · 2014年12月31日

奶牛乳腺脂类合成代谢转录调控机制与基因网络构建

国家自然科学基金

0+阅读 · 2014年12月31日

氧（氮）桥联杯芳烃配位组装体的合成、结构和功能

国家自然科学基金

0+阅读 · 2012年12月31日

Arisandilactone A 的不对称全合成

国家自然科学基金

0+阅读 · 2012年12月31日

深海放线菌Streptomyces sp. SCSIO 03032抗肿瘤天然产物Spiroindimicins生物合成研究

国家自然科学基金

0+阅读 · 2012年12月31日

富含半胱氨酸的酸性分泌蛋白SPARC在胃癌细胞中的表达和调控

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员