How to match on content within XML tags
search cancel

How to match on content within XML tags

book

Article ID: 160529

calendar_today

Updated On:

Products

Data Loss Prevention Enforce

Issue/Introduction

When searching XML files, I seem to be able to match only on values and not tags.
 
Removing the XML declaration (e.g. <?xml version="1.0" encoding="UTF-8"
?>) from the file allows the same regex to match on tags.

Resolution

Under the Monitors Advanced settings, check for the following;

There is the option ContentExtraction.MarkupAsText – turning that to “on” should allow us to match within tags, otherwise they are ignored.

That setting is also described in the Online Help under i.e. the below link:

https://techdocs.broadcom.com/us/en/symantec-security-software/information-security/data-loss-prevention/16-0/Manage-detection-servers/Installing-and-managing-detection-servers-and-cloud-detectors/advanced-server-settings-v23018357-d297e19396.html

With the following definition:

"Bypasses Content Extraction for files that are determined to be XML or HTML. This setting should be used in cases where web pages contain data in the header block or script blocks. Default is off."