Process Intelligence Documentation
processmaker.comDeveloper DocumentationKnowledge Center
User Documentation
User Documentation
  • Process Intelligence
  • 🟪Using Process Intelligence
    • Getting Started
    • Dashboard
    • Integration with ProcessMaker
  • 🟪Data Collection
    • Introduction to Data Collection
    • Data Collection Studio
    • Data Collection Rules
      • Advanced Examples for Data Collection
        • Handling Overlapping URLs in Screen Mapping
        • Click Activity and Field Edit Tracking for Web Applications
        • OCR-Based Identifier Extraction in Desktop Applications
    • Data Collection with X-Ray
      • Working with X-Ray Desktop Tool
      • Example of the X-Ray Workflow
    • Training Data Studio
    • Introduction to Testing
  • 🟪Configuration
    • Service Deployment
    • System and Network Requirements
    • Silent Distribution by IT
    • Chrome Extension
    • Windows Plugin Manual Installation
    • Uninstallation and Cleanup
    • User Roles and Permissions
  • 🟪Architecture
    • Process Intelligence Architecture
    • Architecture Diagrams
Powered by GitBook
On this page
  • Example Scenario
  • Solution: Adjusting Matching Rules
  • Conclusion
  1. Data Collection
  2. Data Collection Rules
  3. Advanced Examples for Data Collection

Handling Overlapping URLs in Screen Mapping

When defining screen mappings based on URLs, overlapping patterns can result in unintended page tagging. This document explains how to manage these overlaps by refining matching rules.

Example Scenario

Consider the following URLs in a screen mapping system:

  • example.domain.com/report

  • example.domain.com/reports-creator

By default, a system might tag example.domain.com/reports-creator with both "report" and "reports-creator" due to the overlap. While acceptable in some cases, precise tagging is often required.

Solution: Adjusting Matching Rules

To prevent example.domain.com/reports-creator from being tagged as "report," modify the matching rule to explicitly exclude overlapping pages.

Updated Matching Rule

{
  "matching_criteria": {
    "rule_engine_rule": "url and not [w for w in context['in_url'] if w not in url] and not [w for w in context['not_in_url'] if w in url]",
    "context": {
      "in_url": [
        "example.domain.com/report"
      ],
      "not_in_url": [
        "reports-creator",
        "other-pages-with-overlap"
      ]
    }
  }
}

Explanation of the Rule

  • Ensures the page URL matches one of the specified values in in_url.

  • Excludes URLs listed under not_in_url.

  • The "context" section defines:

    • in_url: URLs to be matched (example.domain.com/report).

    • not_in_url: Specific pages to exclude (reports-creator, other-pages-with-overlap).

Expected Outcome

  • example.domain.com/report is tagged as "report."

  • example.domain.com/reports-creator is only tagged as "reports-creator."

Alternative Approach

If allowing multiple tags is acceptable, leave the default configuration, which tags example.domain.com/reports-creator with both labels.

Conclusion

By adjusting matching_criteria, URL categorization can be controlled, preventing unwanted overlaps.

PreviousAdvanced Examples for Data CollectionNextClick Activity and Field Edit Tracking for Web Applications

Last updated 3 months ago

🟪