Expand LLM08 embedding inversion section with 2025 research#13
Conversation
- Added ALGEN (ACL 2025) and ZSInvert (arXiv:2504.00147) findings - Updated recovery rate statistics (50-92% word recovery) - Added GDPR/HIPAA compliance framing - Added encryption and rate limiting mitigations (ref GenAI-Security-Project#9, GenAI-Security-Project#10) Signed-off-by: azizrebhi <154744962+azizrebhi@users.noreply.github.com>
S3DFX-CYBER
left a comment
There was a problem hiding this comment.
It's a good addition
But I would suggest a few refinements that would improve precision and alignment with OWASP guidance standards:
Some statements are currently too absolute (e.g., “works on any embedding”, “no longer needs to know which embedding model”). These should be softened to reflect broader applicability without making universal claims.
The tone in parts reads slightly promotional/research-heavy (“significantly lowered the barrier”, “first universal method”). Recommend shifting to more neutral, evidence-based phrasing.
The recovery rate claims (50–92%) are useful, but should be framed with context (e.g., “depending on conditions”) and avoid interpretive conclusions like “sufficient to reconstruct” without qualification.
The compliance statement is currently too definitive. OWASP entries should avoid strong legal assertions; suggest reframing as potential regulatory risk depending on context (e.g., GDPR, HIPAA).
The point about treating vector storage as equivalent to plaintext is insightful and worth keeping, but should be phrased more carefully (e.g., “should be treated as sensitive due to potential reconstruction risk”).
Expanded the discussion on vector and embedding vulnerabilities, including examples of risks and mitigation strategies. Enhanced clarity and detail throughout the document. Signed-off-by: azizrebhi <154744962+azizrebhi@users.noreply.github.com>
…, minor punctuation Reflowed the author's hard-wrapped paragraphs back to single-line format to match the rest of the 2026 directory style. Substantive content preserved verbatim: - 3 new paragraphs under '#### 3. Embedding Inversion Attacks' (ALGEN ACL 2025, ZSInvert arXiv:2504.00147, recovery rates, regulated-industry guidance). - New '#### 5. Encrypt embeddings at rest...' mitigation. - Minor punctuation fix: 'empathy(Ref GenAI-Security-Project#8).' -> 'empathy. (Ref GenAI-Security-Project#8)'.
|
Warning Edit 2026-05-02: This comment described an admin-merge that was made without entry-lead approval. The merge has been reverted in #19, and the original content has been reopened in #21 for proper review by @S3DFX-CYBER and @arshi016. The reflow I had applied is not in the new PR — entry leads will see the author's original wrapping. The original comment text is preserved below for transparency, but the actions it describes are no longer in effect.
|
These two PRs were admin-merged without first routing them through the LLM03 and LLM08 entry leads, which the project owner had wanted done before any merge. This commit restores both 2026/LLM03_*.md and 2026/LLM08_*.md to their state immediately before those merges landed (pre-PR#2 = 7350d2a, pre-PR#13 = beb58df). Once this revert is in, the original PR #2 and PR #13 content will be reopened as fresh PRs targeting main and the entry leads will be tagged for review. PR #17 and PR #11 are not affected — those merges were authorized in a separate decision.
|
@azizrebhi — apologies. This PR was admin-merged on 2026-05-02 without first routing through the LLM08 entry leads (@S3DFX-CYBER and @arshi016), which the project owner @rocklambros had wanted before any merge. That merge has been reverted in #19, and your content has been carried forward verbatim into #21 for proper review. The cherry-pick preserves you as the original commit author for both of your original commits. The paragraph reflow I had applied during the unauthorized merge is not in the new PR — the entry leads will see your original wrapping and can flag any style preferences during review. No action required on your end — @rocklambros will merge once the leads sign off. Thank you for your contribution and for the patience while we sort the workflow. |
Uh oh!
There was an error while loading. Please reload this page.