XPath is a language for addressing parts of an XML document. It is a W3C recommendation.
This article describes Mozilla interfaces exposing XPath functionality to JavaScript code. These are described in DOM Level 3 XPath (which is W3C Working Group Note at this moment).
This article does not attempt teach XPath itself. If you're unfamiliar with this technology, please refer to W3Schools XPath tutorial.
For a very simple XPath usage example, see: Code_snippets:HTML_to_DOM#Using_a_hidden_XUL_iframe_.28complete_example.29
Node-specific evaluator function
The following function can be used to evaluate XPath expressions on given XML nodes. The first argument is a DOM node or Document object, while the second is a string defining an XPath expression.
// Evaluate an XPath expression aExpression against a given DOM node // or Document object (aNode), returning the results as an array // thanks wanderingstan at morethanwarm dot mail dot com for the // initial work. function evaluateXPath(aNode, aExpr) { var xpe = new XPathEvaluator(); var nsResolver = xpe.createNSResolver(aNode.ownerDocument == null ? aNode.documentElement : aNode.ownerDocument.documentElement); var result = xpe.evaluate(aExpr, aNode, nsResolver, 0, null); var found = []; var res; while (res = result.iterateNext()) found.push(res); return found; }
This function uses new XPathEvaluator()
. That constructor is specific to Mozilla. Scripts used on a webpage which might be used by other browsers should instead replace the call to new XPathEvaluator()
with the following fragment:
// XPathEvaluator is implemented on objects that implement Document var xpe = aNode.ownerDocument || aNode;
In that case the creation of the XPathNSResolver can be simplified as:
var nsResolver = xpe.createNSResolver(xpe.documentElement);
Note however that createNSResolver
should only be used if you are sure the namespace prefixes in the XPath expression match those in the document you want to query (and that no default namespace is being used (though see DOM:document.createNSResolver for a workaround)). Otherwise, you have to provide your own implementation of XPathNSResolver.
If you are using XMLHttpRequest to read a local or remote XML file into a DOM tree (as described in Parsing and serializing XML), the first argument to evaluateXPath()
should be req.responseXML
.
Sample usage
Assume we have the following XML document (see also How to Create a DOM tree and Parsing and serializing XML):
<?xml version="1.0"?> <people> <person first-name="eric" middle-initial="H" last-name="jung"> <address street="321 south st" city="denver" state="co" country="usa"/> <address street="123 main st" city="arlington" state="ma" country="usa"/> </person> <person first-name="jed" last-name="brown"> <address street="321 north st" city="atlanta" state="ga" country="usa"/> <address street="123 west st" city="seattle" state="wa" country="usa"/> <address street="321 south avenue" city="denver" state="co" country="usa"/> </person> </people>
You can now "query" the document with XPath expressions. Although walking the DOM tree can achieve similar results, using XPath expressions is much quicker and more powerful. If you can rely on id
attributes, document.getElementById()
is still powerful, but it's not nearly as powerful as XPath. Here are some examples.
// display the last names of all people in the doc var results = evaluateXPath(people, "//person/@last-name"); for (var i in results) alert("Person #" + i + " has the last name " + results[i].value); // get the 2nd person node results = evaluateXPath(people, "/people/person[2]"); // get all the person nodes that have addresses in denver results = evaluateXPath(people, "//person[address/@city='denver']"); // get all the addresses that have "south" in the street name results = evaluateXPath(people, "//address[contains(@street, 'south')]"); alert(results.length);
docEvaluateArray
The following is a simple utility function to get (ordered) XPath results into an array, when there is no special need for namespace resolvers, etc. It avoids the more complex syntax of document.evaluate()
for cases when it is not required as well as the need to use the special iterators on XPathResult
(by returning an array instead).
// Example usage: // var els = docEvaluateArray('//a'); // alert(els[0].nodeName); // gives 'A' in HTML document with at least one link function docEvaluateArray (expr, doc, resolver) { if (!doc) { doc = document; } if (!resolver) { resolver = null; } var result = doc.evaluate(expr, doc, resolver, XPathResult.ORDERED_NODE_SNAPSHOT_TYPE, null); var a = []; for(var i = 0; i < result.snapshotLength; i++) { a[i] = result.snapshotItem(i); } return a; }
getXPathForElement
The following function allows one to pass an element and an XML document to find a unique string XPath expression leading back to that element.
function getXPathForElement(el, xml) { var xpath = ''; var pos, tempitem2; while(el !== xml.documentElement) { pos = 0; tempitem2 = el; while(tempitem2) { if (tempitem2.nodeType === 1 && tempitem2.nodeName === el.nodeName) { // If it is ELEMENT_NODE of the same name pos += 1; } tempitem2 = tempitem2.previousSibling; } xpath = "*[name()='"+el.nodeName+"' and namespace-uri()='"+(el.namespaceURI===null?'':el.namespaceURI)+"']["+pos+']'+'/'+xpath; el = el.parentNode; } xpath = '/*'+"[name()='"+xml.documentElement.nodeName+"' and namespace-uri()='"+(el.namespaceURI===null?'':el.namespaceURI)+"']"+'/'+xpath; xpath = xpath.replace(/\/$/, ''); return xpath; }
Resources
- XPath
- XPath Visualizer for Mozilla and Firefox
- XPath tutorial
- Forum discussion on this topic
- Using the Mozilla JavaScript Interface to XPath - draft tutorial on using XPath from javascript
- Sarissa - Sarissa is a cross-browser ECMAScript library for client side XML manipulation, including loading XML from URLs or strings, performing XSLT transformations, XPath queries and more. Supported: Gecko (Mozilla, Firefox etc), IE, KHTML (Konqueror, Safari). If you're writing JavaScript that is used in both XUL applications and HTML pages, and the HTML pages may be viewed in non-Gecko-based applications (such as Internet Explorer, Opera, Konqueror, Safari), you should consider using Sarissa to parse and/or serialize XML. Note: Do not create a DOM object using
document.implementation.createDocument()
and then use Sarissa classes and methods to manipulate that object. It will not work. You must use Sarissa to create the initial DOM object.