GenAI Flow

dessyordanova · dessyordanova · commit 52a5484e4d63 · 2025-08-06T09:05:12.000+03:00
diff --git a/_config.yml b/_config.yml
@@ -85,6 +85,9 @@ navigation:
     libraries/radwordsprocessing/editing/find-and-replace:
         title: Find and Replace
         position: 6
+    libraries/radwordsprocessing/features/gen-ai-powered-document-insights:
+        title: GenAI-powered Document Insights
+        position: 7
     libraries/radwordsprocessing/concepts:
         title: Concepts
         position: 6
diff --git a/libraries/radpdfprocessing/overview.md b/libraries/radpdfprocessing/overview.md
@@ -32,6 +32,8 @@ The API of RadPdfProcessing contains two different editors,  [RadFixedDocumentEd
 * Digital signatures
     * Signing a document with digital signature.
     * Validate digital signature of already signed document.
+* GenAI-powered Document Insights
+* Accessibility Support
             
 The document model of the library provides support for:
 
@@ -59,6 +61,7 @@ The document model of the library provides support for:
 |[**JavaScript Actions and Trigger Events**]({%slug radpdfprocessing-model-javascript-actions%})|As of Q4 2024 you can import or export the javascript actions associated with pages, form fields, etc. so that they can be executed when the exported document is opened with Adobe Acrobat. |
 |[**Accessibility Support**]({%slug create-accessible-pdf-documents%})|Offers accessibility support of documents to users with disabilities.|
 | [**Viewer Preferences**]({%slug radpdfprocessing-features-viewer-preferences%}) | Control how PDF documents are displayed and behave in PDF viewers, including window behavior, UI visibility, and print settings. |
+|**GenAI-powered Document Insights**|Enables you to easily extract insights from PDF documents using Large Language Models (LLMs). This functionality enables you to summarize document content and ask questions about it, with the AI providing relevant answers based on the document's content. [Read More]({%slug radpdfprocessing-features-gen-ai-powered-document-insights-overview%})|
 
 # See Also
 
diff --git a/libraries/radwordsprocessing/editing/gen-ai-powered-document-insights/complete-context-question-processor.md b/libraries/radwordsprocessing/editing/gen-ai-powered-document-insights/complete-context-question-processor.md
@@ -0,0 +1,69 @@
+---
+title: CompleteContextQuestionProcessor
+description: CompleteContextQuestionProcessor class enables you to ask questions about a Word document and receive answers based on the entire document content.
+page_title: CompleteContextQuestionProcessor
+slug: radwordsprocessing-features-gen-ai-powered-document-insights-complete-context-question-processor
+tags: ai, document, analysis, question, processor, complete, context
+published: True
+position: 5
+---
+<style>
+table, th, td {
+	border: 1px solid;
+}
+table th:first-of-type {
+	width: 30%;
+}
+table th:nth-of-type(2) {
+	width: 70%;
+} 
+</style>
+
+# CompleteContextQuestionProcessor
+
+The **CompleteContextQuestionProcessor** class enables you to ask questions about a Word document and receive answers based on the entire document content. This processor sends the complete document text to the AI model, which is suitable for smaller documents or when you need to ensure that the AI model has access to all the information in the document. This class inherits from the abstract **AIProcessorBase** class, which provides common functionality for all AI processors.
+
+The **CompleteContextQuestionProcessor** is ideal for the following scenarios:
+
+1. **Small Documents**: When the document is small enough to fit within the token limit of the AI model.
+2. **Holistic Understanding**: When the question requires understanding the entire document context.
+3. **Simplicity**: When you don't need the advanced embedding functionality of [PartialContextQuestionProcessor]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-partial-context-question-processor%}).
+
+However, if you're working with larger documents or want to optimize token usage, you should use the [PartialContextQuestionProcessor]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-partial-context-question-processor%}#when-to-use-partialcontextquestionprocessor) instead.
+
+## Public API
+
+|Property|Description|
+|---|---|
+|**Settings**|Gets the settings for the AI question-answering process. Returns [CompleteContextProcessorSettings]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-complete-context-question-processor%}#completecontextprocessorsettings).|
+
+|Method|Description|
+|---|---|
+|**public Task<string> AnswerQuestion(ISimpleTextDocument document, string question)**|Answers a question using the provided document. Parameters: **document** - The document containing the text to process, **question** - The question to answer. Returns a task that represents the asynchronous operation. The task result contains the answer to the question.|
+
+>caution **Security Warning:** The output produced by this API is generated by a Large Language Model (LLM). As such, the content should be considered untrusted and may include unexpected or unsafe data. It is strongly recommended to properly sanitize or encode all output before displaying it in a user interface, logging, or using it in any security-sensitive context.
+
+## CompleteContextProcessorSettings
+
+The **CompleteContextProcessorSettings** class provides configuration options for the question-answering process.
+
+### Settings Properties
+
+* **ModelMaxInputTokenLimit**: Gets or sets the maximum input token limit the model allows.
+* **TokenizationEncoding**: Gets or sets the tokenization encoding.
+* **ModelId**: Gets or sets the ID of the model.
+
+## Usage Example
+
+The following example demonstrates how to use the **CompleteContextQuestionProcessor** to ask questions about a Word document, including working with specific document pages. For setting up the AI client as shown in this example, see the [AI Provider Setup]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-prerequisites%}#ai-provider-setup) section:
+
+#### __[C#] Example 1: Using CompleteContextQuestionProcessor__
+
+<snippet id='libraries-flow-features-gen-ai-ask-questions-using-complete-context'/>
+
+## See Also
+
+* [GenAI-powered Document Insights Overview]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-overview%})
+* [Prerequisites]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-prerequisites%})
+* [SummarizationProcessor]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-summarization-processor%})
+* [PartialContextQuestionProcessor]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-partial-context-question-processor%})
diff --git a/libraries/radwordsprocessing/editing/gen-ai-powered-document-insights/getting-started.md b/libraries/radwordsprocessing/editing/gen-ai-powered-document-insights/getting-started.md
@@ -0,0 +1,33 @@
+---
+title: Getting Started
+description: Learn how to use the GenAI-powered Document Insights functionality to summarize a Word document with WordsProcessing.
+page_title: Overview
+slug: radwordsprocessing-features-gen-ai-powered-document-insights-getting-started
+tags: ai, document, analysis, overview, word, flow, processing, genai, powered, insights
+published: True
+position: 2
+---
+
+# Getting Started
+
+The following example demonstrates how to use the GenAI-powered Document Insights functionality to summarize a Word document and ask questions about it:
+
+>note The following code snippet is valid for Azure Open AI 9.3. The specific **IChatClient** initialization may be different according to the specific version.
+
+>important For .NET {{site.mindotnetversion}}+ (Target OS Windows) with [Packages for .NET Framework and .NET {{site.mindotnetversion}} and .NET {{site.maxdotnetversion}} for Windows]({%slug available-nuget-packages%}#packages-for-net-framework-and-net-{{site.mindotnetversion}}-and-net-{{site.maxdotnetversion}}-for-windows), an [IEmbeddingsStorage]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-partial-context-question-processor%}#implementing-custom-iembeddingsstorage) implementation is required for the [PartialContextQuestionProcessor]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-partial-context-question-processor%}).
+
+#### __[C#] Example 1: Using GenAI-powered Document Insights__
+
+<snippet id='libraries-flow-features-gen-ai-getting-started'/>
+
+When you run this code, the AI will process your document, generate a summary, and answer your questions.
+
+<!-- >note A sample runnable project is available in the Document Processing SDK: [AIConnectorDemo](https://github.com/telerik/document-processing-sdk/tree/master/WordsProcessing/AIConnectorDemo). -->
+
+## See Also
+
+* [Prerequisites]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-prerequisites%})
+* [SummarizationProcessor]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-summarization-processor%})
+* [PartialContextQuestionProcessor]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-partial-context-question-processor%})
+* [Custom IEmbeddingsStorage Implementation]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-partial-context-question-processor%}#implementing-custom-iembeddingsstorage)
+* [CompleteContextQuestionProcessor]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-complete-context-question-processor%})
diff --git a/libraries/radwordsprocessing/editing/gen-ai-powered-document-insights/overview.md b/libraries/radwordsprocessing/editing/gen-ai-powered-document-insights/overview.md
@@ -0,0 +1,36 @@
+---
+title: Overview
+description: Learn more about the GenAI-powered Document Insights feature of the WordsProcessing library. 
+page_title: Overview
+slug: radwordsprocessing-features-gen-ai-powered-document-insights-overview
+tags: ai, document, analysis, overview, word, processing, genai, powered, insights
+published: True
+position: 0
+---
+
+# GenAI-powered Document Insights Overview
+
+The GenAI-powered Document Insights feature enables you to easily extract insights from Word documents using Large Language Models (LLMs). This functionality allows you to summarize document content and ask questions about the document, with the AI providing relevant answers based on the document's content.
+
+## Key Features
+
+* **Extract Document Insights**: Quickly understand the key points of lengthy documents.
+* **Efficient Information Retrieval**: Ask specific questions about your documents and receive accurate answers.
+* **Token Optimization**: Reduce token usage by only sending relevant portions of the document to the AI model as shown in the [PartialContextQuestionProcessor]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-partial-context-question-processor%}#when-to-use-partialcontextquestionprocessor) section.
+* **Multiple LLM Support**: Compatible with different AI providers including Azure OpenAI, OpenAI, and Ollama as described in the [Prerequisites]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-prerequisites%}#ai-provider-setup).
+
+The GenAI-powered Document Insights feature includes three main components:
+
+|Processor|Description|
+|----|----|
+|**[SummarizationProcessor]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-summarization-processor%})**|Generates concise summaries of Word documents.|
+|**[CompleteContextQuestionProcessor]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-complete-context-question-processor%})**|Answers questions by providing the entire document content to the AI model.|
+|**[PartialContextQuestionProcessor]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-partial-context-question-processor%})**|Answers questions by providing only the relevant portions of the document to the AI model.|
+
+## See Also
+
+* [Prerequisites]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-prerequisites%})
+* [Getting Started]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-getting-started%})
+* [SummarizationProcessor]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-summarization-processor%})
+* [PartialContextQuestionProcessor]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-partial-context-question-processor%})
+* [CompleteContextQuestionProcessor]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-complete-context-question-processor%})
diff --git a/libraries/radwordsprocessing/editing/gen-ai-powered-document-insights/partial-context-question-processor.md b/libraries/radwordsprocessing/editing/gen-ai-powered-document-insights/partial-context-question-processor.md
@@ -0,0 +1,107 @@
+---
+title: PartialContextQuestionProcessor
+description: PartialContextQuestionProcessor class enables you to ask questions about a Word document and receive answers based on the most relevant parts of the document content.
+page_title: PartialContextQuestionProcessor
+slug: radwordsprocessing-features-gen-ai-powered-document-insights-partial-context-question-processor
+tags: ai, document, analysis, question, processor, partial, context, embeddings
+published: True
+position: 4
+---
+<style>
+table, th, td {
+    border: 1px solid;
+}
+table th:first-of-type {
+    width: 65%;
+}
+table th:nth-of-type(2) {
+    width: 10%;
+}
+table th:nth-of-type(3) {
+    width: 25%;
+}
+</style>
+
+# PartialContextQuestionProcessor
+
+The **PartialContextQuestionProcessor** class enables you to ask questions about a Word document and receive answers based on the most relevant parts of the document content. This processor uses embeddings to identify and send only the relevant portions of the document to the AI model, making it more efficient for token usage and more suitable for large documents. This class inherits from the abstract **AIProcessorBase** class, which provides common functionality for all AI processors.
+
+The **PartialContextQuestionProcessor** is ideal for the following scenarios:
+
+1. **Large Documents**: When the document exceeds the token limit of the AI model and cannot be processed in a single call.
+2. **Efficient Token Usage**: When you want to minimize token consumption and optimize costs.
+3. **Specific Questions**: When questions are targeted at specific information within the document rather than requiring complete document understanding.
+
+## Public API and Configuration
+
+|Constructor|Platform|Description|
+|---|---|---|
+|**PartialContextQuestionProcessor(IChatClient chatClient, int modelMaxInputTokenLimit, ISimpleTextDocument document)**|_Specific*_ |Creates an instance with built-in embeddings storage|
+|**PartialContextQuestionProcessor(IChatClient chatClient, IEmbeddingsStorage embeddingsStorage, int modelMaxInputTokenLimit, ISimpleTextDocument document)**|Any|Creates an instance with custom embeddings storage|
+
+> _*Specific_ The .NET {{site.mindotnetversion}}+ (Target OS Windows) + [Packages for .NET Framework and .NET {{site.mindotnetversion}} and .NET {{site.maxdotnetversion}} for Windows]({%slug available-nuget-packages%}#packages-for-net-framework-and-net-{{site.mindotnetversion}}-and-net-{{site.maxdotnetversion}}-for-windows) constructor uses **DefaultEmbeddingsStorage** internally, while the cross-platform constructor requires a custom implementation of **IEmbeddingsStorage** as shown in the [Custom IEmbeddingsStorage Setup]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-partial-context-question-processor%}#implementing-custom-iembeddingsstorage) section.
+
+### Properties and Methods
+
+|Member|Type|Description|
+|---|---|---|
+|**Settings**|Property|Gets the **PartialContextProcessorSettings** for configuring the AI process|
+|**AnswerQuestion(string question)**|Method|Returns an answer to the question using relevant document context|
+
+>caution **Security Warning:** The output produced by this API is generated by a Large Language Model (LLM). As such, the content should be considered untrusted and may include unexpected or unsafe data. It is strongly recommended to properly sanitize or encode all output before displaying it in a user interface, logging, or using it in any security-sensitive context.
+
+### PartialContextProcessorSettings
+
+The settings class provides configuration options for the question-answering process:
+
+* **ModelMaxInputTokenLimit**: Maximum input token limit the model allows
+* **TokenizationEncoding**: Tokenization encoding used
+* **ModelId**: ID of the AI model
+* **MaxNumberOfEmbeddingsSent**: Maximum number of context chunks sent (default: 30)
+* **EmbeddingTokenSize**: Size in tokens of each context chunk (default: 300)
+
+## Usage Examples
+
+#### Example 1: Using PartialContextQuestionProcessor with default embeddings storage.
+
+This example demonstrates how to use the **PartialContextQuestionProcessor** with the built-in embeddings storage on .NET {{site.mindotnetversion}}+ (Target OS Windows) + [Packages for .NET Framework and .NET {{site.mindotnetversion}} and .NET {{site.maxdotnetversion}} for Windows]({%slug available-nuget-packages%}#packages-for-net-framework-and-net-{{site.mindotnetversion}}-and-net-{{site.maxdotnetversion}}-for-windows). For setting up the AI client, see the [AI Provider Setup]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-prerequisites%}#ai-provider-setup) section:
+
+<snippet id='libraries-flow-features-gen-ai-ask-questions-using-partial-context'/>
+
+#### Example 2: Using PartialContextQuestionProcessor with Custom Embeddings (.NET Standard/.NET Framework)
+
+This example demonstrates how to use the **PartialContextQuestionProcessor** with a custom embeddings storage implementation as described in the [Custom IEmbeddingsStorage Setup]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-partial-context-question-processor%}#implementing-custom-iembeddingsstorage) section:
+
+<snippet id='libraries-flow-features-gen-ai-ask-questions-using-partial-context-iembeddingsstorage'/>
+
+### Implementing custom IEmbeddingsStorage
+
+A sample custom implementation for the OllamaEmbeddingsStorage is shown in the below code snippet:
+
+>note Requires installing the following NuGet packages:
+> * **LangChain**
+> * **LangChain.Databases.Sqlite**
+> * **Microsoft.Extensions.AI.Ollama**
+> * **Telerik.Windows.Documents.AIConnector**
+> * **Telerik.Windows.Documents.Fixed**  
+
+1. Install Ollama from [ollama.com](https://ollama.com/).
+2. Pull the model you want to use.
+3. Start the Ollama server.
+
+<snippet id='libraries-pdf-features-gen-ai-ask-questions-using-partial-context-ollama-embeddings-storage'/>
+
+#### Example 3: Processing Specific Pages
+
+<snippet id='libraries-flow-features-gen-ai-summarize-process-specific-pages'/>
+
+#### Example 4: Optimizing Embeddings Settings
+
+<snippet id='libraries-flow-features-gen-ai-summarize-optimize-embeddings-storage'/>
+
+## See Also
+
+* [GenAI-powered Document Insights Overview]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-overview%})
+* [Prerequisites]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-prerequisites%})
+* [SummarizationProcessor]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-summarization-processor%})
+* [CompleteContextQuestionProcessor]({%slug radwordsprocessing-features-gen-ai-powered-document-insights-complete-context-question-processor%})
diff --git a/libraries/radwordsprocessing/editing/gen-ai-powered-document-insights/prerequisites.md b/libraries/radwordsprocessing/editing/gen-ai-powered-document-insights/prerequisites.md
diff --git a/libraries/radwordsprocessing/editing/gen-ai-powered-document-insights/summarization-processor.md b/libraries/radwordsprocessing/editing/gen-ai-powered-document-insights/summarization-processor.md
diff --git a/libraries/radwordsprocessing/overview.md b/libraries/radwordsprocessing/overview.md