行业研究公司研究宏观策略财报招股书会议纪要 seedance2.0 低空经济 DeepSeek AIGC 大模型

2025年机密信息泄露态势报告

信息技术 2025-03-10 GitGuardian 杨框子

核心观点与关键数据

凭证泄露问题持续恶化：2024年，公共GitHub仓库中新增的硬编码凭证数量同比增长25%，达到23,770,171个。其中，通用凭证（如用户名和密码）占比高达58%，成为增长最快、最具威胁的类别。
GitHub推送保护效果有限：虽然GitHub的推送保护功能有效减少了特定凭证（如OpenAI和GitHub App密钥）的泄露，但通用凭证的泄露数量仍显著增加。推送保护无法检测通用凭证，且仅对公共仓库免费，对私有仓库需要付费。
私有仓库泄露风险更高：私有仓库中凭证泄露的可能性是公共仓库的8倍。组织可能依赖“安全通过 obscurity”策略，假设私有仓库中的凭证是安全的，但实际上攻击者一旦获取访问权限，可以轻松地在系统间横向移动。
协作工具成为新的泄露重灾区：Slack、Jira 或 Confluence 等协作工具中的凭证泄露风险不容忽视，38%的泄露事件被归类为高度关键或紧急，高于SCM工具的31%。这是因为开发者在使用协作工具时对凭证更加随意，且这些工具缺乏类似SCM的内置安全控制。
Docker Hub存在大量有效凭证泄露：对Docker Hub的15 million个公共镜像分析发现，约有100,000个有效凭证（包括AWS密钥、GCP密钥和GitHub令牌）被暴露。大部分泄露发生在镜像层，且镜像标签中存在ENV、RUN和ARG指令泄露凭证的情况。
AI工具加剧凭证泄露风险：使用GitHub Copilot的仓库中，凭证泄露率比所有公共仓库高出40%。这表明AI工具在提高开发效率的同时，并未改善代码安全性。
凭证泄露后修复不及时：70%的2022年在公共仓库中检测到的有效凭证在2025年仍然有效。这表明组织缺乏对泄露凭证的可见性，且未能有效执行凭证生命周期管理实践（如自动过期和轮换）。
凭证管理工具并非万能：即使在使用凭证管理工具的环境中，凭证泄露仍然发生。分析显示，在使用凭证管理工具的2,584个公共仓库中，有132个仓库泄露了至少一个凭证（占5.1%）。
过度授权加剧泄露影响：GitLab和GitHub令牌中存在大量过度授权的情况，例如99%的GitLab API密钥具有全部访问权限或只读权限，96%的GitHub令牌具有写入权限，95%具有对仓库的完全访问权限。

研究结论

凭证泄露问题日益严重，且呈多样化趋势：通用凭证、协作工具、容器镜像和AI工具成为新的泄露重灾区。
组织需要采取综合措施应对凭证泄露：除了加强凭证检测，还需要重视凭证修复、权限管理、凭证生命周期管理以及凭证管理工具的整合。
管理非人类身份（NHI）及其凭证是关键挑战：组织需要将凭证修复作为关键安全目标，提供全面的开发者培训，自动化凭证轮换和撤销，并集中化管理凭证管理工具。
不同类型凭证泄露风险各异：组织需要针对不同类型的凭证泄露采取不同的应对策略，并认识到即使看似低频泄露的凭证也可能带来高风险。
需要从简单的凭证检测转向全面的凭证生命周期管理和快速事件响应能力：组织需要建立完善的安全体系，以应对凭证泄露带来的持续威胁。

2025 T A B L E O F C O N T E N T S AI-Enhanced Detection: Revealing the Full Scope of Credential Exposure558% of All Detected Secrets Are Generic7GitHub’s Push Protection: A Promising Initiative, But Not a Silver Bullet9Private Repositories 8 Times More Likely To Contain Secrets12Fastest Growing Services17 Mapping the SDLC: Where Leaks Happen18 Collaboration Tools: The Overlooked Frontier of Secrets Sprawl18100,000+ Valid Secrets on Docker Hub21Copilot increases secrets incidence rate by 40%25 Secrets Managers: Not a Complete Solution28Excessive Permissions Make Secret Leaks More Severe31Bridging the remediation gap33 Understanding the Impact: Real-World Risks of Secrets Sprawl34 About GitGuardian40 41 Methodology43 The State ofSecrets Sprawl 2025 DATA ANALYSIS BYGITGUARDIAN 15%of commit authorsleaked a secret 70%of valid secrets detected in publicrepositories in 2022 remain active today 4.6%o f a ll p ub lic r e p o s i to r ie s co n t ain a s e c r e t 35%of all private repositoriescontain hardcoded secrets 38% of incidents in collaboration and projectmanagement tools (Slack, Jira or Confluence) wereclassified as highly critical or urgent, compared to31% in Source Code Management Systems (SCMs) From day one, GitGuardian has been committed to protecting developer environments from secretssprawl, a dedication that has established us asthe #1 application on GitHub Marketplace. For overseven years, our real-time scanning of public GitHub events through ourGood Samaritanprogram hasenabled us to proactively notify developers when credentials are exposed. In 2024 alone, we sent1.9 million pro bono alert emailsto developers who inadvertently leaked sensitive credentials. How Leaky Was 2024 Long-lived plaintext credentials have been involved in most breaches over the last several years.When valid credentials, such as API keys, passwords, and authentication tokens, leak, attackersat any skill level can gain initial access or perform rapid lateral movement through systems. In 2024, we found23,770,171 new hardcoded secretsadded to public GitHub repositories.This figure represents a25%surge in the total number of secrets from the previous year.This marks a substantial increase in the number of secrets found and continuesthe disturbing trend: secrets sprawl is steadily worsening over time. Despite GitHub’s efforts to prevent certain credential leaks during the push stage, which didindeed reduce incidents involving specific secrets (secrets following known patterns for specificservices), the platform’s measures have not effectively addressed the growing prevalence ofgeneric secrets. It is within this category that we observed the most significant year-over-yearsurge in plaintext credentials. The danger of the continued rise of secrets leakage is very real. Over the past 10 years, stolencredentials have been used in 31% of all breaches, according toVerizon’s 2024 Data BreachInvestigations Report. It is an attacker’s favorite way to gain an initial foothold and to move laterallythrough environments. At the same time,IBM’s Cost of a Data Breachreport makes it clear how time-consuming this issue is for the enterprise. Breaches involving stolen or compromised credentials takean average of292 daysto identify and remediate, more than any other attack vector. AI-Enhanced Detection: Revealing the FullScope of Credential Exposure The 2025 State of Secrets Sprawl report marks a significant milestone in secrets detection,unveiling a more comprehensive picture of the secrets sprawl landscape. For the first time,thanks to our innovative machine learning models, such as the one poweringFalse PositiveRemover, GitGuardian can now confidently identify and validate more generic secrets. Historically, GitGuardian took a conservative stance on generic secrets to avoid a large numberof potential false positive results. Our secrets detection engine was intentionally calibrated forhigh precision, ensuring that when a secret was flagged, it was almost certainly a real secret.Any doubt meant leaving it out. Our past focus was concentrated on the most commonly used enterprise-specific secrets,such as API keys and service-specific credentials, but these are just the tip of the iceberg.The true magnitude of the secrets sprawl problem lies in the vast ocean of generic secrets,such as usernames & passwords and unstructured credentials. As an example, here’s a Base64 basic auth string: “Authorization”: “Basic aW50ZXJuc2hpcDpjZGk=” Or an example of a database credential: connect_to_db(host=”136.12.43.86”, port=8130,username=”root”,password=”m42ploz2wd”) This ML-driven shift not only enables us to find more secrets but also helps us categorize themmuch more effectively. Doing so we ensure they are genuine secrets, strengthening bothrecalland precision. The result provides a more accurate, holistic understanding of how and wheresecrets are spreading. The Department of The Treasury breach In December 2024, Chinese state-

点击免费查看完整报告

你可能感兴趣

2025年机密信息泄露态势报告

核心观点与关键数据

研究结论

你可能感兴趣

2025年中国数据泄露风险态势报告

全球数据泄露态势月度报告（2025年4月）

全球数据泄露态势月度报告（2025年8月）

全球数据泄露态势月度报告（2025年10月）

2025年上半年数据泄露风险态势报告