IssueGuard: Real-Time Secret Leak Prevention Tool for GitHub Issue Reports

GitHub and GitLab are widely used collaborative platforms whose issue-tracking systems contain large volumes of unstructured text, including logs, code snippets, and configuration examples. This creates a significant risk of accidental secret exposure, such as API keys and credentials, yet these platforms provide no mechanism to warn users before submission. We present \textsc{IssueGuard}, a tool for real-time detection and prevention of secret leaks in issue reports. Implemented as a Chrome extension, \textsc{IssueGuard} analyzes text as users type and combines regex-based candidate extraction with a fine-tuned CodeBERT model for contextual classification. This approach effectively separates real secrets from false positives and achieves an F1-score of 92.70\% on a benchmark dataset, outperforming traditional regex-based scanners. \textsc{IssueGuard} integrates directly into the web interface and continuously analyzes the issue editor, presenting clear visual warnings to help users avoid submitting sensitive data. The source code is publicly available at \href{https://github.com/disa-lab/IssueGuard}{https://github.com/disa-lab/IssueGuard} , and a demonstration video is available at \href{https://youtu.be/kvbWA8rr9cU}{https://youtu.be/kvbWA8rr9cU} .

翻译：GitHub与GitLab作为广泛使用的协作平台，其问题追踪系统包含大量非结构化文本，如日志、代码片段及配置示例。这导致了API密钥、凭证等敏感信息意外泄露的高风险，然而这些平台并未在提交前提供预警机制。本文提出\textsc{IssueGuard}工具，用于实时检测并阻止问题报告中的秘密泄露。作为Chrome扩展程序实现，\textsc{IssueGuard}在用户输入时实时分析文本，通过基于正则表达式的候选提取与微调后的CodeBERT模型进行上下文分类。该方法有效区分真实秘密与误报，在基准数据集上达到92.70%的F1分值，性能优于传统正则扫描器。\textsc{IssueGuard}直接集成于网页界面，持续分析问题编辑器状态，通过可视化警告帮助用户避免提交敏感数据。源代码开源发布至\href{https://github.com/disa-lab/IssueGuard}{https://github.com/disa-lab/IssueGuard}，演示视频见\href{https://youtu.be/kvbWA8rr9cU}{https://youtu.be/kvbWA8rr9cU}。

相关内容

GitHub

关注 88

http://GitHub.com 使用 Git 作为版本控制系统（version control system）提供在线源码托管的服务，同时是个有社交功能的开发者社区。国外类似服务： http://Bitbucket.com
http://Gitlab.com
国内类似服务：
http://Coding.net

《匿名保密通信框架：基于区块链的概念验证》美海军2022最新154页论文

专知会员服务

23+阅读 · 2022年12月21日

【硬核书】Git版本控制，用于协作软件开发的强大工具和技术，第三版，745页pdf

专知会员服务

45+阅读 · 2022年11月1日

《用对抗样本防御基于深度学习的视频指纹攻击》美海军研究生院2022最新60页论文

专知会员服务

28+阅读 · 2022年10月7日

自监督学习未来是掩码自编码器？KAIST最新《自监督学习掩码自编码器》研究进展

专知会员服务

35+阅读 · 2022年8月3日