Stable
Create a permanent, invisible page and access its DOM.
Usage
The module exports a constructor function Page
, which constructs a new page worker. A page worker may be destroyed, after which its memory is freed, and you must create a new instance to load another page.
You specify the page to load using the contentURL
option to the Page()
constructor. This can point to a remote file:
pageWorker = require("sdk/page-worker").Page({ contentScript: "console.log(document.body.innerHTML);", contentURL: "https://en.wikipedia.org/wiki/Internet" });
It can also point to an HTML file which you've packaged with your add-on. To do this, save the file in your add-on's data
directory and create the URL using the data.url()
method of the self
module:
pageWorker = require("sdk/page-worker").Page({ contentScript: "console.log(document.body.innerHTML);", contentURL: require("sdk/self").data.url("myFile.html") });
From Firefox 34, you can use "./myFile.html"
as an alias for self.data.url("myFile.html")
. So you can rewrite the above code like this:
pageWorker = require("sdk/page-worker").Page({ contentScript: "console.log(document.body.innerHTML);", contentURL: "./myFile.html" });
You can load a new page by setting the page worker's contentURL
property. In this example we fetch the first paragraph of a page from Wikipedia, then the first paragraph of a different page:
var getFirstParagraph = "var paras = document.getElementsByTagName('p');" + "console.log(paras[0].textContent);" + "self.port.emit('loaded');" pageWorker = require("sdk/page-worker").Page({ contentScript: getFirstParagraph, contentURL: "https://en.wikipedia.org/wiki/Chalk" }); pageWorker.port.on("loaded", function() { pageWorker.contentURL = "https://en.wikipedia.org/wiki/Cheese" });
Scripting page content
To access the page's DOM you need to attach a script to it. In the SDK these scripts are called "content scripts" because they're explicitly used for interacting with web content.
You can specify one or more content scripts to load into the page using the contentScript
or contentScriptFile
options to the Page()
constructor. With contentScript
you pass the script as a string, as in the examples above. With contentScriptFile
you pass a URL which points to a script saved under your add-on's data
directory. You construct the URL using the data.url()
method of the self
module.
While content scripts can access DOM content, they can't access any of the SDK APIs, so in many cases you'll need to exchange messages between the content script and your main add-on code for a complete solution.
For example, the content script might read some content and send it back to the main add-on, which could store it using the simple-storage
API. You can communicate with the script using either the postMessage()
API or (preferably, usually) the port
API.
For example, this add-on loads a page from Wikipedia, and runs a content script in it to send all the headers back to the main add-on code:
var pageWorkers = require("sdk/page-worker"); // This content script sends header titles from the page to the add-on: var script = "var elements = document.querySelectorAll('h2 > span'); " + "for (var i = 0; i < elements.length; i++) { " + " postMessage(elements[i].textContent) " + "}"; // Create a page worker that loads Wikipedia: pageWorkers.Page({ contentURL: "https://en.wikipedia.org/wiki/Internet", contentScript: script, contentScriptWhen: "ready", onMessage: function(message) { console.log(message); } });
For conciseness, this example creates the content script as a string and uses the contentScript
property. In your own add-ons, you will probably want to create your content scripts in separate files and pass their URLs using the contentScriptFile
property.
Unless your content script is extremely simple and consists only of a static string, don't use contentScript
: if you do, you may have problems getting your add-on approved on AMO.
Instead, keep the script in a separate file and load it using contentScriptFile
. This makes your code easier to maintain, secure, debug and review.
To learn much more about content scripts, see the Working with Content Scripts guide.
Scripting trusted page content
We've already seen that you can package HTML files in your add-on's data
directory and load them using page-worker
. We can call this "trusted" content, because unlike content loaded from a source outside the add-on, the add-on author knows exactly what it's doing. To interact with trusted content you don't need to use content scripts: you can just include a script from the HTML file in the normal way, using <script>
tags.
Like a content script, these scripts can communicate with the add-on code using the postMessage()
API or the port
API. The crucial difference is that these scripts access the postMessage
and port
objects through the addon
object, whereas content scripts access them through the self
object.
So given an add-on that loads trusted content and uses content scripts to access it, there are typically three changes you have to make, if you want to use normal page scripts instead:
-
in the content script: change occurrences of
self
toaddon
. For example,self.port.emit("my-event")
becomesaddon.port.emit("my-event")
. -
in the HTML page itself: add a
<script>
tag to load the script. So if your content script is saved underdata
as "my-script.js", you need a line like<script src="my-script.js"></script>
in the page header. -
in the "main.js" file: remove the
contentScriptFile
option in thePage()
constructor.
Globals
Constructors
Page(options)
Creates an uninitialized page worker instance.
Parameters
options : object
Optional options:
Name | Type | |
---|---|---|
contentURL | string |
The URL of the content to load in the panel. |
allow | object |
An object with keys to configure the permissions on the page worker. The boolean key |
include |
string, array of (String or RegExp) |
This is useful when your page worker loads a page which will redirect to other pages.
A match pattern string or an array of match pattern strings. These define the documents to which the page-worker's content worker applies. At least one match pattern must be supplied. See the match-pattern module for a detailed description of match pattern syntax. |
contentScriptFile | string,array |
A local file URL or an array of local file URLs of content scripts to load. Content scripts specified by this option are loaded before those specified by the |
contentScript | string,array |
A string or an array of strings containing the texts of content scripts to load. Content scripts specified by this option are loaded after those specified by the |
contentScriptWhen | string |
When to load the content scripts. This may take one of the following values:
This property is optional and defaults to "end". |
contentScriptOptions | object |
Read-only value exposed to content scripts under Any kind of jsonable value (object, array, string, etc.) can be used here. Optional. |
onMessage | function |
Use this to add a listener to the page worker's |
Page
A Page
object loads the page specified by its contentURL
option and executes any content scripts that have been supplied to it in the contentScript
and contentScriptFile
options.
The page is not displayed to the user.
The page worker is loaded as soon as the Page
object is created and stays loaded until its destroy
method is called or the add-on is unloaded.
Methods
destroy()
Unloads the page worker. After you destroy a page worker, its memory is freed and you must create a new instance if you need to load another page.
postMessage(message)
Sends a message to the content scripts.
Parameters
message : value
The message to send. Must be JSON-able.
on(type, listener)
Registers an event listener with the page worker. See Working with Events for help with events.
Parameters
type : string
The type of event to listen for.
listener : function
The listener function that handles the event.
removeListener(type, listener)
Unregisters an event listener from the page worker.
Parameters
type : string
The type of event for which listener
was registered.
listener : function
The listener function that was registered.
Properties
port
Object that allows you to:
- send events to the content script using the
port.emit
function - receive events from the content script using the
port.on
function
See the guide to communicating using port
for details.
contentURL
The URL of content to load. This can point to local content loaded from your add-on's "data" directory or remote content. Setting it loads the content immediately.
allow
A object describing permissions for the content. It contains a single key named script
whose value is a boolean that indicates whether or not to execute script in the content. script
defaults to true.
include
A set of match patterns to define the urls which the page-worker's content script will be applied. This is useful when using pages which redirect to other pages in your page-worker.
contentScriptFile
A local file URL or an array of local file URLs of content scripts to load.
contentScript
A string or an array of strings containing the texts of content scripts to load.
contentScriptWhen
When to load the content scripts. This may have one of the following values:
- "start": load content scripts immediately after the document element for the page is inserted into the DOM, but before the DOM content itself has been loaded
- "ready": load content scripts once DOM content has been loaded, corresponding to the DOMContentLoaded event
- "end": load content scripts once all the content (DOM, JS, CSS, images) for the page has been loaded, at the time the window.onload event fires
contentScriptOptions
Read-only value exposed to content scripts under self.options
property.
Any kind of jsonable value (object, array, string, etc.) can be used here. Optional.
Events
message
If you listen to this event you can receive message events from content scripts associated with this page worker. When a content script posts a message using self.postMessage()
, the message is delivered to the add-on code in the page worker's message
event.
Arguments
value : Listeners are passed a single argument which is the message posted from the content script. The message can be any JSON-serializable value
error
This event is emitted when an uncaught runtime error occurs in one of the page worker's content scripts.
Arguments
Error : Listeners are passed a single argument, the Error object.