Getting Python Types Right with RightTyper - 专知论文

会员服务 ·

0

推断 · Python · 代码 · 讲稿 · 查准率/准确率 ·

Getting Python Types Right with RightTyper

翻译：暂无翻译

Juan Altmayer Pizzorno,Emery D. Berger

Python type annotations enable static type checking, but most code remains untyped because manual annotation is time-consuming and tedious. Past approaches to automatic type inference fall short: static methods struggle with dynamic features and infer overly broad types; AI-based methods are unsound and miss rare types; and dynamic methods impose extreme overheads (up to 270x), lack important language support such as inferring variable types, or produce annotations that cause runtime errors. This paper presents RightTyper, a novel hybrid approach for Python that produces accurate and precise type annotations grounded in actual program behavior. RightTyper grounds inference in types observed during actual program execution and combines these observations with static analysis and name resolution to produce substantially higher-quality type annotations than prior approaches. Through principled, statistically guided adaptive sampling, RightTyper balances runtime overhead with the need to observe sufficient execution behavior to infer high-quality type annotations. We evaluate RightTyper against static, dynamic, and AI-based systems on both synthetic benchmarks and real-world code, and find that it consistently achieves higher semantic similarity to ground-truth and developer-written annotations, respectively, while incurring only approximately 27% runtime overhead.

翻译：暂无翻译

0

相关内容

掌握使用Python的大型语言模型

掌握使用Python的大型语言模型

专知会员服务

63+阅读 · 2024年5月22日

【新书】Google 大模型怎么用？Gemini 用于 Python：使用 Bard 编程，258页pdf

【新书】Google 大模型怎么用？Gemini 用于 Python：使用 Bard 编程，258页pdf

专知会员服务

75+阅读 · 2024年3月14日

【2022新书】TypeScript编程，使你的JavaScript应用程序规模化，324页pdf

【2022新书】TypeScript编程，使你的JavaScript应用程序规模化，324页pdf

专知会员服务

77+阅读 · 2022年2月5日

【干货书】Python简洁代码第二版，422页pdf，Clean Code in Python, 2nd Edition

【干货书】Python简洁代码第二版，422页pdf，Clean Code in Python, 2nd Edition

专知会员服务

37+阅读 · 2021年1月15日

【2020新书】高级Python编程，620页pdf

【2020新书】高级Python编程，620页pdf

专知会员服务

240+阅读 · 2020年7月31日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

105+阅读 · 2020年6月21日

【干货书】流畅Python，766页pdf，中英文版

【干货书】流畅Python，766页pdf，中英文版

专知会员服务

228+阅读 · 2020年3月22日

【2020新书】Python Pro专业实践原则，Practices of the Python Pro，250页pdf

【2020新书】Python Pro专业实践原则，Practices of the Python Pro，250页pdf

专知会员服务

153+阅读 · 2020年1月25日

【Python Tricks新书】The book: A Buffet of Awesome Python Features，299页pdf

【Python Tricks新书】The book: A Buffet of Awesome Python Features，299页pdf

专知会员服务

45+阅读 · 2020年1月1日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

197+阅读 · 2019年10月10日

一个牛逼的 Python 调试工具

一个牛逼的 Python 调试工具

机器学习算法与Python学习

15+阅读 · 2019年4月30日

百闻不如一码！手把手教你用Python搭一个Transformer

百闻不如一码！手把手教你用Python搭一个Transformer

大数据文摘

18+阅读 · 2019年4月22日

Github项目推荐 | Dragonfly：可扩展贝叶斯优化库（Python）

Github项目推荐 | Dragonfly：可扩展贝叶斯优化库（Python）

AI研习社

11+阅读 · 2019年3月22日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

使用Python进行图像增强

使用Python进行图像增强

AI研习社

17+阅读 · 2018年9月30日

文本分类又来了，用 Scikit-Learn 解决多类文本分类问题

文本分类又来了，用 Scikit-Learn 解决多类文本分类问题

AI研习社

14+阅读 · 2018年7月22日

在Python中使用SpaCy进行文本分类

在Python中使用SpaCy进行文本分类

专知

24+阅读 · 2018年5月8日

荐书丨Python数据分析从入门到精通

荐书丨Python数据分析从入门到精通

程序人生

18+阅读 · 2018年3月31日

Python 3 尴尬了这么久，终于有救了

Python 3 尴尬了这么久，终于有救了

AI100

13+阅读 · 2017年11月18日

各种相似性度量及Python实现

各种相似性度量及Python实现

机器学习算法与Python学习

11+阅读 · 2017年7月6日

含非正态及缺失数据的结构方程模型分析

国家自然科学基金

0+阅读 · 2015年12月31日

顾及扫描上下文的预测与判决相结合的点云在线分类方法

国家自然科学基金

0+阅读 · 2015年12月31日

基于上下文精化的并发对象活性的描述及验证

国家自然科学基金

1+阅读 · 2015年12月31日

类簇级测试中类测试序的生成技术研究

国家自然科学基金

1+阅读 · 2015年12月31日

几类密码方案的格分析优化技术

国家自然科学基金

1+阅读 · 2015年12月31日

方差正则化的分类模型选择方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

面向异构信息网络中实体归类的模糊聚类

国家自然科学基金

1+阅读 · 2015年12月31日

面向二进制程序的静态结构化符号执行与动态组合方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

结合图像块联合聚类加权和混合分类器的非对齐稀疏表示识别方法

国家自然科学基金

1+阅读 · 2015年12月31日

自检测型量子密钥分配研究

国家自然科学基金

0+阅读 · 2014年12月31日

ToolRosetta: Bridging Open-Source Repositories and Large Language Model Agents through Automated Tool Standardization

Arxiv

0+阅读 · 3月10日

Unlocking Python's Cores: Hardware Usage and Energy Implications of Removing the GIL

Arxiv

0+阅读 · 3月5日

Advances in List Decoding of Polynomial Codes

Arxiv

0+阅读 · 3月4日

A Variational Estimator for $L_p$ Calibration Errors

Arxiv

0+阅读 · 2月27日

trainsum -- A Python package for quantics tensor trains

Arxiv

0+阅读 · 2月23日

PyTrim: A Practical Tool for Reducing Python Dependency Bloat

Arxiv

0+阅读 · 2月20日

AnCoder: Anchored Code Generation via Discrete Diffusion Models

Arxiv

0+阅读 · 2月5日

Getting Python Types Right with RightTyper

Arxiv

0+阅读 · 2月3日

MATCH: Metadata-Aware Text Classification in A Large Hierarchy

Arxiv

12+阅读 · 2021年2月15日

Graph Convolutional Networks for Text Classification

Arxiv

11+阅读 · 2018年10月17日

VIP会员

文章信息

相关主题

查准率/准确率

相关VIP内容

掌握使用Python的大型语言模型

掌握使用Python的大型语言模型

专知会员服务

63+阅读 · 2024年5月22日

【新书】Google 大模型怎么用？Gemini 用于 Python：使用 Bard 编程，258页pdf

【新书】Google 大模型怎么用？Gemini 用于 Python：使用 Bard 编程，258页pdf

专知会员服务

75+阅读 · 2024年3月14日

【2022新书】TypeScript编程，使你的JavaScript应用程序规模化，324页pdf

【2022新书】TypeScript编程，使你的JavaScript应用程序规模化，324页pdf

专知会员服务

77+阅读 · 2022年2月5日

【干货书】Python简洁代码第二版，422页pdf，Clean Code in Python, 2nd Edition

【干货书】Python简洁代码第二版，422页pdf，Clean Code in Python, 2nd Edition

专知会员服务

37+阅读 · 2021年1月15日

【2020新书】高级Python编程，620页pdf

【2020新书】高级Python编程，620页pdf

专知会员服务

240+阅读 · 2020年7月31日

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

【新书】人工智能Python代码，227页pdf，Python code for Artificial Intelligence: Foundations of Computational Agents

专知会员服务

105+阅读 · 2020年6月21日

【干货书】流畅Python，766页pdf，中英文版

【干货书】流畅Python，766页pdf，中英文版

专知会员服务

228+阅读 · 2020年3月22日

【2020新书】Python Pro专业实践原则，Practices of the Python Pro，250页pdf

【2020新书】Python Pro专业实践原则，Practices of the Python Pro，250页pdf

专知会员服务

153+阅读 · 2020年1月25日

【Python Tricks新书】The book: A Buffet of Awesome Python Features，299页pdf

【Python Tricks新书】The book: A Buffet of Awesome Python Features，299页pdf

专知会员服务

45+阅读 · 2020年1月1日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

197+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《不对称消耗：乌克兰与伊朗“沙赫德”项目中低成本无人机作战的定量分析（2022-2026年）》2026最新358页

《美陆军条令：野战炮兵营作战》2026版

谷歌Gemini军事AI扩展至五角大楼上百万人员，取代Anthropic

《多智能体影响图在混合威胁建模中的应用》最新30页报告

相关资讯

一个牛逼的 Python 调试工具

一个牛逼的 Python 调试工具

机器学习算法与Python学习

15+阅读 · 2019年4月30日

百闻不如一码！手把手教你用Python搭一个Transformer

百闻不如一码！手把手教你用Python搭一个Transformer

大数据文摘

18+阅读 · 2019年4月22日

Github项目推荐 | Dragonfly：可扩展贝叶斯优化库（Python）

Github项目推荐 | Dragonfly：可扩展贝叶斯优化库（Python）

AI研习社

11+阅读 · 2019年3月22日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

使用Python进行图像增强

使用Python进行图像增强

AI研习社

17+阅读 · 2018年9月30日

文本分类又来了，用 Scikit-Learn 解决多类文本分类问题

文本分类又来了，用 Scikit-Learn 解决多类文本分类问题

AI研习社

14+阅读 · 2018年7月22日

在Python中使用SpaCy进行文本分类

在Python中使用SpaCy进行文本分类

专知

24+阅读 · 2018年5月8日

荐书丨Python数据分析从入门到精通

荐书丨Python数据分析从入门到精通

程序人生

18+阅读 · 2018年3月31日

Python 3 尴尬了这么久，终于有救了

Python 3 尴尬了这么久，终于有救了

AI100

13+阅读 · 2017年11月18日

各种相似性度量及Python实现

各种相似性度量及Python实现

机器学习算法与Python学习

11+阅读 · 2017年7月6日

相关论文

ToolRosetta: Bridging Open-Source Repositories and Large Language Model Agents through Automated Tool Standardization

Arxiv

0+阅读 · 3月10日

Unlocking Python's Cores: Hardware Usage and Energy Implications of Removing the GIL

Arxiv

0+阅读 · 3月5日

Advances in List Decoding of Polynomial Codes

Arxiv

0+阅读 · 3月4日

A Variational Estimator for $L_p$ Calibration Errors

Arxiv

0+阅读 · 2月27日

trainsum -- A Python package for quantics tensor trains

Arxiv

0+阅读 · 2月23日

PyTrim: A Practical Tool for Reducing Python Dependency Bloat

Arxiv

0+阅读 · 2月20日

AnCoder: Anchored Code Generation via Discrete Diffusion Models

Arxiv

0+阅读 · 2月5日

Getting Python Types Right with RightTyper

Arxiv

0+阅读 · 2月3日

MATCH: Metadata-Aware Text Classification in A Large Hierarchy

Arxiv

12+阅读 · 2021年2月15日

Graph Convolutional Networks for Text Classification

Arxiv

11+阅读 · 2018年10月17日

相关基金

含非正态及缺失数据的结构方程模型分析

国家自然科学基金

0+阅读 · 2015年12月31日

顾及扫描上下文的预测与判决相结合的点云在线分类方法

国家自然科学基金

0+阅读 · 2015年12月31日

基于上下文精化的并发对象活性的描述及验证

国家自然科学基金

1+阅读 · 2015年12月31日

类簇级测试中类测试序的生成技术研究

国家自然科学基金

1+阅读 · 2015年12月31日

几类密码方案的格分析优化技术

国家自然科学基金

1+阅读 · 2015年12月31日

方差正则化的分类模型选择方法研究

国家自然科学基金

1+阅读 · 2015年12月31日

面向异构信息网络中实体归类的模糊聚类

国家自然科学基金

1+阅读 · 2015年12月31日

面向二进制程序的静态结构化符号执行与动态组合方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

结合图像块联合聚类加权和混合分类器的非对齐稀疏表示识别方法

国家自然科学基金

1+阅读 · 2015年12月31日

自检测型量子密钥分配研究

国家自然科学基金

0+阅读 · 2014年12月31日

微信扫码咨询专知VIP会员