Skip to content

[Feature Request]: Analysis for document delete support approach #2078

@gona-sreelatha

Description

@gona-sreelatha

Do you need to file an issue?

  • I have searched the existing issues and this feature is not already filed.
  • My model is hosted on OpenAI or Azure. If not, please look at the "model providers" issue and don't file a new one here.
  • I believe this is a legitimate feature request, not just a question. If this is a question, please use the Discussions area.

Is your feature request related to a problem? Please describe.

I’m planning to extend GraphRAG to support delete operations by introducing a custom workflow.

Proposed Approach

  1. Delete the target document.
  2. Delete its associated text_units.
  3. Remove references to those text_units from entities and relationships.
    •  If an entity/relationship has other text_unit references, update it accordingly.
      
    • If the entity/relationship is only linked to the deleted text_units, remove it entirely.
      
  4. Re-run community generation and community summarization to ensure consistency.

Questions

  1. Would this approach be sufficient to properly handle deletes?
  2. Are there any pitfalls or edge cases I should watch out for?
  3. Do you have any alternative suggestions or best practices for implementing deletes in GraphRAG?

I would deeply appreciate any inputs, suggestions, and thoughts from maintainers and community.
Thanks for your input!

@natoverse @AlonsoGuevara @darthtrevino

Describe the solution you'd like

No response

Additional context

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions