Editor’s Draft, 3 December 2025
More details about this documentCopyright © 2025 World Wide Web Consortium. W3C® liability, trademark and permissive document license rules apply.
This specification provides an API for representing file objects in web applications, as well as programmatically selecting them and accessing their data. This includes:
A FileList interface, which represents an array of individually selected files from the underlying system. The user interface for selection can be invoked via <input type="file">, i.e. when the input element is in the File Upload state [HTML].
A Blob interface, which represents immutable raw binary data, and allows access to ranges of bytes within the Blob object as a separate Blob.
A File interface, which includes readonly informational attributes about a file such as its name and the date of the last modification (on disk) of the file.
A FileReader interface, which provides methods to read a File or a Blob, and an event model to obtain the results of these reads.
A URL scheme for use with binary data such as files, so that they can be referenced within web applications.
Additionally, this specification defines objects to be used within threaded web applications for the synchronous reading of files.
§ 10 Requirements and Use Cases covers the motivation behind this specification.
This API is designed to be used in conjunction with other APIs and elements on the web platform, notably: XMLHttpRequest (e.g. with an overloaded send() method for File or Blob arguments), postMessage(), DataTransfer (part of the drag and drop API defined in [HTML]) and Web Workers. Additionally, it should be possible to programmatically obtain a list of files from the input element when it is in the File Upload state [HTML]. These kinds of behaviors are defined in the appropriate affiliated specifications.
This document was published by the Web Applications Working Group as an Editor’s Draft.
Publication as an Editor’s Draft does not imply endorsement by W3C and its Members.
This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than a work in progress.
Previous discussion of this specification has taken place on two other mailing lists: public-webapps@w3.org (archive) and public-webapi@w3.org (archive). Ongoing discussion will be on the public-webapps@w3.org mailing list.
This draft consists of changes made to the previous Last Call Working Draft. Please send comments to the public-webapi@w3.org as described above. You can see Last Call Feedback on the W3C Wiki: https://www.w3.org/wiki/Webapps/LCWD-FileAPI-20130912
An implementation report is automatically generated from the test suite.
This document was produced by a group operating under the W3C Patent Policy. W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent that the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy.
This document is governed by the 18 August 2025 W3C Process Document.
This section is informative.
Web applications should have the ability to manipulate as wide as possible a range of user input, including files that a user may wish to upload to a remote server or manipulate inside a rich web application. This specification defines the basic representations for files, lists of files, errors raised by access to files, and programmatic ways to read files. Additionally, this specification also defines an interface that represents "raw data" which can be asynchronously processed on the main thread of conforming user agents. The interfaces and API defined in this specification can be used with other interfaces and APIs exposed to the web platform.
The File interface represents file data typically obtained from the underlying file system, and the Blob interface ("Binary Large Object" - a name originally introduced to web APIs in Google Gears) represents immutable raw data. File or Blob reads should happen asynchronously on the main thread, with an optional synchronous API used within threaded web applications. An asynchronous API for reading files prevents blocking and UI "freezing" on a user agent’s main thread. This specification defines an asynchronous API based on an event model to read and access a File or Blob’s data. A FileReader object provides asynchronous read methods to access that file’s data through event handler content attributes and the firing of events. The use of events and event handlers allows separate code blocks the ability to monitor the progress of the read (which is particularly useful for remote drives or mounted drives, where file access performance may vary from local drives) and error conditions that may arise during reading of a file. An example will be illustrative.
When this specification says to terminate an algorithm the user agent must terminate the algorithm after finishing the step it is on. Asynchronous read methods defined in this specification may return before the algorithm in question is terminated, and can be terminated by an abort() call.
The algorithms and steps in this specification use the following mathematical operations:
max(a,b) returns the maximum of a and b, and is always performed on integers as they are defined in WebIDL [WebIDL]; in the case of max(6,4) the result is 6. This operation is also defined in ECMAScript [ECMA-262].
min(a,b) returns the minimum of a and b, and is always performed on integers as they are defined in WebIDL [WebIDL]; in the case of min(6,4) the result is 4. This operation is also defined in ECMAScript [ECMA-262].
Mathematical comparisons such as < (less than), ≤ (less than or equal to), and > (greater than) are as in ECMAScript [ECMA-262].
The term Unix Epoch is used in this specification to refer to the time 00:00:00 UTC on January 1 1970 (or 1970-01-01T00:00:00Z ISO 8601); this is the same time that is conceptually "0" in ECMA-262 [ECMA-262].
Let originalSize be blob’s size.
The start parameter, if non-null, is a value for the start point of a slice blob call, and must be treated as a byte-order position, with the zeroth position representing the first byte. User agents must normalize start according to the following:
The end parameter, if non-null. is a value for the end point of a slice blob call. User agents must normalize end according to the following:
The contentType parameter, if non-null, is used to set the ASCII-encoded string in lower case representing the media type of the Blob. User agents must normalize contentType according to the following:
If relativeContentType contains any characters outside the range of U+0020 to U+007E, then set relativeContentType to the empty string and return from these substeps.
Convert every character in relativeContentType to ASCII lowercase.
Let span be max((relativeEnd - relativeStart), 0).
Return a new Blob object S with the following characteristics:
A Blob object refers to a byte sequence, and has a size attribute which is the total number of bytes in the byte sequence, and a type attribute, which is an ASCII-encoded string in lower case representing the media type of the byte sequence.
Each Blob must have an internal snapshot state, which must be initially set to the state of the underlying storage, if any such underlying storage exists. Further normative definition of snapshot state can be found for Files.
[Exposed=(Window,Worker), Serializable] interface Blob { constructor(optional sequence<BlobPart> blobParts, optional BlobPropertyBag options = {}); readonly attribute unsigned long long size; readonly attribute DOMString type; // slice Blob into byte-ranged chunks Blob slice(optional [Clamp] long long start, optional [Clamp] long long end, optional DOMString contentType); // read from the Blob. [NewObject] ReadableStream stream(); [NewObject] Promise<USVString> text(); [NewObject] Promise<ArrayBuffer> arrayBuffer(); [NewObject] Promise<Uint8Array> bytes(); }; enum EndingType { "transparent", "native" }; dictionary BlobPropertyBag { DOMString type = ""; EndingType endings = "transparent"; }; typedef (BufferSource or Blob or USVString) BlobPart;Blob objects are serializable objects. Their serialization steps, given value and serialized, are:
Set serialized.[[SnapshotState]] to value’s snapshot state.
Set serialized.[[ByteSequence]] to value’s underlying byte sequence.
Their deserialization step, given serialized and value, are:
Set value’s snapshot state to serialized.[[SnapshotState]].
Set value’s underlying byte sequence to serialized.[[ByteSequence]].
Let stream be a new ReadableStream created in blob’s relevant Realm.
Set up stream with byte reading support.
Run the following steps in parallel:
While not all bytes of blob have been read:
Let bytes be the byte sequence that results from reading a chunk from blob, or failure if a chunk cannot be read.
Queue a global task on the file reading task source given blob’s relevant global object to perform the following steps:
If bytes is failure, then error stream with a failure reason and abort these steps.
Let chunk be a new Uint8Array wrapping an ArrayBuffer containing bytes. If creating the ArrayBuffer throws an exception, then error stream with that exception and abort these steps.
Enqueue chunk in stream.
We need to specify more concretely what reading from a Blob actually does, what possible errors can happen, perhaps something about chunk sizes, etc.
Return stream.
If invoked with zero parameters, return a new Blob object consisting of 0 bytes, with size set to 0, and with type set to the empty string.
Let bytes be the result of processing blob parts given blobParts and options.
If the type member of the options argument is not the empty string, run the following sub-steps:
Let t be the type dictionary member. If t contains any characters outside the range U+0020 to U+007E, then set t to the empty string and return from these substeps.
Convert every character in t to ASCII lowercase.
Return a Blob object referring to bytes as its associated byte sequence, with its size set to the length of bytes, and its type set to the value of t from the substeps above.
The Blob() constructor can be invoked with the parameters below:
A blobParts sequence which takes any number of the following types of elements, and in any order:BufferSource elements.
Blob elements.
USVString elements.
type, the ASCII-encoded string in lower case representing the media type of the Blob. Normative conditions for this member are provided in the § 3.1 Constructors.
endings, an enum which can take the values "transparent" or "native". By default this is set to "transparent". If set to "native", line endings will be converted to native in any USVString elements in blobParts.
Let bytes be an empty sequence of bytes.
For each element in parts:
If element is a USVString, run the following substeps:
Let s be element.
If the endings member of options is "native", set s to the result of converting line endings to native of element.
Append the result of UTF-8 encoding s to bytes.
Note: The algorithm from WebIDL [WebIDL] replaces unmatched surrogates in an invalid utf-16 string with U+FFFD replacement characters. Scenarios exist when the Blob constructor may result in some data loss due to lost or scrambled character sequences.
If element is a BufferSource, get a copy of the bytes held by the buffer source, and append those bytes to bytes.
If element is a Blob, append the bytes it represents to bytes.
Note: The type of the Blob array element is ignored and will not affect type of returned Blob object.
Return bytes.
Let native line ending be the code point U+000A LF.
If the underlying platform’s conventions are to represent newlines as a carriage return and line feed sequence, set native line ending to the code point U+000D CR followed by the code point U+000A LF.
Set result to the empty string.
Let position be a position variable for s, initially pointing at the start of s.
Let token be the result of collecting a sequence of code points that are not equal to U+000A LF or U+000D CR from s given position.
Append token to result.
While position is not past the end of s:
If the code point at position within s equals U+000D CR:
Append native line ending to result.
Advance position by 1.
If position is not past the end of s and the code point at position within s equals U+000A LF advance position by 1.
Otherwise if the code point at position within s equals U+000A LF, advance position by 1 and append native line ending to result.
Let token be the result of collecting a sequence of code points that are not equal to U+000A LF or U+000D CR from s given position.
Append token to result.
Return result.
The type attribute can be set by the web application itself through constructor invocation and through the slice() call; in these cases, further normative conditions for this attribute are in § 3.1 Constructors, § 4.1 Constructor, and § 3.3.1 The slice() method respectively. User agents can also determine the type of a Blob, especially if the byte sequence is from an on-disk file; in this case, further normative conditions are in the file type guidelines.
Note: The type t of a Blob is considered a parsable MIME type, if performing the parse a MIME type algorithm to a byte sequence converted from the ASCII-encoded string representing the Blob object’s type does not return failure.
Note: Use of the type attribute informs the package data algorithm and determines the Content-Type header when fetching blob URLs.
Let sliceStart, sliceEnd, and sliceContentType be null.
If start is given, set sliceStart to start.
If end is given, set sliceEnd to end.
If contentType is given, set sliceContentType to contentType.
Return the result of slice blob given this, sliceStart, sliceEnd, and sliceContentType.
The stream() method, when invoked, must return the result of calling get stream on this.
The text() method, when invoked, must run these steps:
Let stream be the result of calling get stream on this.
Let reader be the result of getting a reader from stream. If that threw an exception, return a new promise rejected with that exception.
Let promise be the result of reading all bytes from stream with reader.
Return the result of transforming promise by a fulfillment handler that returns the result of running UTF-8 decode on its first argument.
Note: This is different from the behavior of readAsText() to align better with the behavior of Fetch’s text(). Specifically this method will always use UTF-8 as encoding, while FileReader can use a different encoding depending on the blob’s type and passed in encoding name.
The arrayBuffer() method, when invoked, must run these steps:
Let stream be the result of calling get stream on this.
Let reader be the result of getting a reader from stream. If that threw an exception, return a new promise rejected with that exception.
Let promise be the result of reading all bytes from stream with reader.
Return the result of transforming promise by a fulfillment handler that returns a new ArrayBuffer whose contents are its first argument.
The bytes() method, when invoked, must run these steps:
Let stream be the result of calling get stream on this.
Let reader be the result of getting a reader from stream. If that threw an exception, return a new promise rejected with that exception.
Let promise be the result of reading all bytes from stream with reader.
Return the result of transforming promise by a fulfillment handler that returns a new Uint8Array wrapping an ArrayBuffer containing its first argument.
A File object is a Blob object with a name attribute, which is a string; it can be created within the web application via a constructor, or is a reference to a byte sequence from a file from the underlying (OS) file system.
If a File object is a reference to a byte sequence originating from a file on disk, then its snapshot state should be set to the state of the file on disk at the time the File object is created.
Note: This is a non-trivial requirement to implement for user agents, and is thus not a must but a should [RFC2119]. User agents should endeavor to have a File object’s snapshot state set to the state of the underlying storage on disk at the time the reference is taken. If the file is modified on disk following the time a reference has been taken, the File’s snapshot state will differ from the state of the underlying storage. User agents may use modification time stamps and other mechanisms to maintain snapshot state, but this is left as an implementation detail.
When a File object refers to a file on disk, user agents must return the type of that file, and must follow the file type guidelines below:
User agents must return the type as an ASCII-encoded string in lower case, such that when it is converted to a corresponding byte sequence, it is a parsable MIME type, or the empty string – 0 bytes – if the type cannot be determined.
When the file is of type text/plain user agents must NOT append a charset parameter to the dictionary of parameters portion of the media type [MIMESNIFF].
User agents must not attempt heuristic determination of encoding, including statistical methods.
File objects are serializable objects. Their serialization steps, given value and serialized, are:
Set serialized.[[SnapshotState]] to value’s snapshot state.
Set serialized.[[ByteSequence]] to value’s underlying byte sequence.
Set serialized.[[Name]] to the value of value’s name attribute.
Set serialized.[[LastModified]] to the value of value’s lastModified attribute.
Their deserialization steps, given value and serialized, are:
Set value’s snapshot state to serialized.[[SnapshotState]].
Set value’s underlying byte sequence to serialized.[[ByteSequence]].
Initialize the value of value’s name attribute to serialized.[[Name]].
Initialize the value of value’s lastModified attribute to serialized.[[LastModified]].
Let bytes be the result of processing blob parts given fileBits and options.
Let n be the fileName argument to the constructor.
Note: Underlying OS filesystems use differing conventions for file name; with constructed files, mandating UTF-16 lessens ambiquity when file names are converted to byte sequences.
Process FilePropertyBag dictionary argument by running the following substeps:
If the type member is provided and is not the empty string, let t be set to the type dictionary member. If t contains any characters outside the range U+0020 to U+007E, then set t to the empty string and return from these substeps.
Convert every character in t to ASCII lowercase.
If the lastModified member is provided, let d be set to the lastModified dictionary member. If it is not provided, set d to the current date and time represented as the number of milliseconds since the Unix Epoch (which is the equivalent of Date.now() [ECMA-262]).
Note: Since ECMA-262 Date objects convert to long long values representing the number of milliseconds since the Unix Epoch, the lastModified member could be a Date object [ECMA-262].
Return a new File object F such that:
F refers to the bytes byte sequence.
F.size is set to the number of total bytes in bytes.
F.name is set to n.
F.type is set to t.
F.lastModified is set to d.
The File() constructor can be invoked with the parameters below:
A fileBits sequence which takes any number of the following elements, and in any order:BufferSource elements.
USVString elements.
An optional lastModified member, which must be a long long; normative conditions for this member are provided in § 4.1 Constructor.
The File interface is available on objects that expose an attribute of type FileList; these objects are defined in HTML [HTML]. The File interface, which inherits from Blob, is immutable, and thus represents file data that can be read into memory at the time a read operation is initiated. User agents must process reads on files that no longer exist at the time of read as errors, throwing a NotFoundError exception if using a FileReaderSync on a Web Worker [Workers] or firing an error event with the error attribute returning a NotFoundError.
Note: The FileList interface should be considered "at risk" since the general trend on the Web Platform is to replace such interfaces with the Array platform object in ECMAScript [ECMA-262]. In particular, this means syntax of the sort filelist.item(0) is at risk; most other programmatic use of FileList is unlikely to be affected by the eventual migration to an Array type.
This interface is a list of File objects.
[Exposed=(Window,Worker), Serializable] interface FileList { getter File? item(unsigned long index); readonly attribute unsigned long length; };FileList objects are serializable objects. Their serialization steps, given value and serialized, are:
Set serialized.[[Files]] to an empty list.
For each file in value, append the sub-serialization of file to serialized.[[Files]].
Their deserialization step, given serialized and value, are:
For each file of serialized.[[Files]], add the sub-deserialization of file to value.
index must be treated by user agents as value for the position of a File object in the FileList, with 0 representing the first file. Supported property indices are the numbers in the range zero to one less than the number of File objects represented by the FileList object. If there are no such File objects, then there are no supported property indices.
Note: The HTMLInputElement interface has a readonly attribute of type FileList, which is what is being accessed in the above example. Other interfaces with a readonly attribute of type FileList include the DataTransfer interface.
This specification defines a new generic task source called the file reading task source, which is used for all tasks that are queued in this specification to read byte sequences associated with Blob and File objects. It is to be used for features that trigger in response to asynchronously reading binary data.
A FileReader has an associated state, that is "empty", "loading", or "done". It is initially "empty".
A FileReader has an associated result (null, a DOMString or an ArrayBuffer). It is initially null.
A FileReader has an associated error (null or a DOMException). It is initially null.
The FileReader() constructor, when invoked, must return a new FileReader object.
The readyState attribute’s getter, when invoked, switches on this’s state and runs the associated step:
"empty"Return EMPTY
"loading"Return LOADING
"done"Return DONE
The result attribute’s getter, when invoked, must return this’s result.
The error attribute’s getter, when invoked, must return this’s error.
If fr’s state is "loading", throw an InvalidStateError DOMException.
Set fr’s state to "loading".
Set fr’s result to null.
Set fr’s error to null.
Let stream be the result of calling get stream on blob.
Let reader be the result of getting a reader from stream.
Let bytes be an empty byte sequence.
Let chunkPromise be the result of reading a chunk from stream with reader.
Let isFirstChunk be true.
In parallel, while true:
Wait for chunkPromise to be fulfilled or rejected.
If chunkPromise is fulfilled, and isFirstChunk is true, queue a task to fire a progress event called loadstart at fr.
We might change loadstart to be dispatched synchronously, to align with XMLHttpRequest behavior. [Issue #119]
Set isFirstChunk to false.
If chunkPromise is fulfilled with an object whose done property is false and whose value property is a Uint8Array object, run these steps:
Let bs be the byte sequence represented by the Uint8Array object.
Append bs to bytes.
If roughly 50ms have passed since these steps were last invoked, queue a task to fire a progress event called progress at fr.
Set chunkPromise to the result of reading a chunk from stream with reader.
Otherwise, if chunkPromise is fulfilled with an object whose done property is true, queue a task to run the following steps and abort this algorithm:
Set fr’s state to "done".
Let result be the result of package data given bytes, type, blob’s type, and encodingName.
If package data threw an exception error:
Set fr’s error to error.
Fire a progress event called error at fr.
Else:
Set fr’s result to result.
Fire a progress event called load at the fr.
If fr’s state is not "loading", fire a progress event called loadend at the fr.
Note: Event handler for the load or error events could have started another load, if that happens the loadend event for this load is not fired.
Otherwise, if chunkPromise is rejected with an error error, queue a task to run the following steps and abort this algorithm:
Set fr’s state to "done".
Set fr’s error to error.
Fire a progress event called error at fr.
If fr’s state is not "loading", fire a progress event called loadend at fr.
Note: Event handler for the error event could have started another load, if that happens the loadend event for this load is not fired.
Use the file reading task source for all these tasks.
The following are the event handler content attributes (and their corresponding event handler event types) that user agents must support on FileReader as DOM attributes:
| onloadstart | loadstart |
| onprogress | progress |
| onabort | abort |
| onerror | error |
| onload | load |
| onloadend | loadend |
The FileReader object has been constructed, and there are no pending reads. None of the read methods have been called. This is the default state of a newly minted FileReader object, until one of the read methods have been called on it.
LOADING (numeric value 1)A File or Blob is being read. One of the read methods is being processed, and no error has occurred during the read.
DONE (numeric value 2)The entire File or Blob has been read into memory, OR a file read error occurred, OR the read was aborted using abort(). The FileReader is no longer reading a File or Blob. If readyState is set to DONE it means at least one of the read methods have been called on this FileReader.
The FileReader interface makes available several asynchronous read methods—readAsArrayBuffer(), readAsBinaryString(), readAsText() and readAsDataURL(), which read files into memory.
Note: If multiple concurrent read methods are called on the same FileReader object, user agents throw an InvalidStateError on any of the read methods that occur when readyState = LOADING.
(FileReaderSync makes available several synchronous read methods. Collectively, the sync and async read methods of FileReader and FileReaderSync are referred to as just read methods.)
6.2.3.1. The readAsDataURL() methodThe readAsDataURL(blob) method, when invoked, must initiate a read operation for blob with DataURL.
6.2.3.2. The readAsText() methodThe readAsText(blob, encoding) method, when invoked, must initiate a read operation for blob with Text and encoding.
6.2.3.3. The readAsArrayBuffer()The readAsArrayBuffer(blob) method, when invoked, must initiate a read operation for blob with ArrayBuffer.
6.2.3.4. The readAsBinaryString() methodThe readAsBinaryString(blob) method, when invoked, must initiate a read operation for blob with BinaryString.
Note: The use of readAsArrayBuffer() is preferred over readAsBinaryString(), which is provided for backwards compatibility.
6.2.3.5. The abort() methodWhen the abort() method is called, the user agent must run the steps below:
If this’s state is "empty" or if this’s state is "done" set this’s result to null and terminate this algorithm.
If this’s state is "loading" set this’s state to "done" and set this’s result to null.
If there are any tasks from this on the file reading task source in an affiliated task queue, then remove those tasks from that task queue.
Terminate the algorithm for the read method being processed.
Fire a progress event called abort at this.
If this’s state is not "loading", fire a progress event called loadend at this.
Return bytes as a DataURL [RFC2397] subject to the considerations below:
Use mimeType as part of the Data URL if it is available in keeping with the Data URL specification [RFC2397].
If mimeType is not available return a Data URL without a media-type. [RFC2397].
Better specify how the DataURL is generated. [Issue #104]
TextLet encoding be failure.
If the encodingName is present, set encoding to the result of getting an encoding from encodingName.
If encoding is failure, and mimeType is present:
Let type be the result of parse a MIME type given mimeType.
If type is not failure, set encoding to the result of getting an encoding from type’s parameters["charset"].
If encoding is failure, then set encoding to UTF-8.
Decode bytes using fallback encoding encoding, and return the result.
Return a new ArrayBuffer whose contents are bytes.
BinaryStringReturn bytes as a binary string, in which every byte is represented by a code unit of equal value [0..255].
The FileReader object must be the event target for all events in this specification.
When this specification says to fire a progress event called e (for some ProgressEvent e at a given FileReader reader), the following are normative:
The progress event e does not bubble. e.bubbles must be false [DOM]
The progress event e is NOT cancelable. e.cancelable must be false [DOM]
The following are the events that are fired at FileReader objects.
| loadstart | ProgressEvent | When the read starts. |
| progress | ProgressEvent | While reading (and decoding) blob |
| abort | ProgressEvent | When the read has been aborted. For instance, by invoking the abort() method. |
| error | ProgressEvent | When the read has failed (see file read errors). |
| load | ProgressEvent | When the read has successfully completed. |
| loadend | ProgressEvent | When the request has completed (either in success or failure). |
This section is informative.
The following are invariants applicable to event firing for a given asynchronous read method in this specification:
Once a loadstart has been fired, a corresponding loadend fires at completion of the read, UNLESS any of the following are true:
the read method has been cancelled using abort() and a new read method has been invoked
the event handler function for a load event initiates a new read
the event handler function for a error event initiates a new read.
Note: The events loadstart and loadend are not coupled in a one-to-one manner.
One progress event will fire when blob has been completely read into memory.
No progress event fires after any one of abort, load, and error have fired. At most one of abort, load, and error fire for a given read.
Web Workers allow for the use of synchronous File or Blob read APIs, since such reads on threads do not block the main thread. This section defines a synchronous API, which can be used within Workers [Workers]. Workers can avail of both the asynchronous API (the FileReader object) and the synchronous API (the FileReaderSync object).
This interface provides methods to synchronously read File or Blob objects into memory.
[Exposed=(DedicatedWorker,SharedWorker)] interface FileReaderSync { constructor(); // Synchronously return strings ArrayBuffer readAsArrayBuffer(Blob blob); DOMString readAsBinaryString(Blob blob); DOMString readAsText(Blob blob, optional DOMString encoding); DOMString readAsDataURL(Blob blob); }; 6.5.1.1. ConstructorsWhen the FileReaderSync() constructor is invoked, the user agent must return a new FileReaderSync object.
6.5.1.2. The readAsText()The readAsText(blob, encoding) method, when invoked, must run these steps:
Let stream be the result of calling get stream on blob.
Let reader be the result of getting a reader from stream.
Let promise be the result of reading all bytes from stream with reader.
Wait for promise to be fulfilled or rejected.
If promise fulfilled with a byte sequence bytes:
Return the result of package data given bytes, Text, blob’s type, and encoding.
Throw promise’s rejection reason.
The readAsDataURL(blob) method, when invoked, must run these steps:
Let stream be the result of calling get stream on blob.
Let reader be the result of getting a reader from stream.
Let promise be the result of reading all bytes from stream with reader.
Wait for promise to be fulfilled or rejected.
If promise fulfilled with a byte sequence bytes:
Return the result of package data given bytes, DataURL, and blob’s type.
Throw promise’s rejection reason.
The readAsArrayBuffer(blob) method, when invoked, must run these steps:
Let stream be the result of calling get stream on blob.
Let reader be the result of getting a reader from stream.
Let promise be the result of reading all bytes from stream with reader.
Wait for promise to be fulfilled or rejected.
If promise fulfilled with a byte sequence bytes:
Return the result of package data given bytes, ArrayBuffer, and blob’s type.
Throw promise’s rejection reason.
The readAsBinaryString(blob) method, when invoked, must run these steps:
Let stream be the result of calling get stream on blob.
Let reader be the result of getting a reader from stream.
Let promise be the result of reading all bytes from stream with reader.
Wait for promise to be fulfilled or rejected.
If promise fulfilled with a byte sequence bytes:
Return the result of package data given bytes, BinaryString, and blob’s type.
Throw promise’s rejection reason.
Note: The use of readAsArrayBuffer() is preferred over readAsBinaryString(), which is provided for backwards compatibility.
File read errors can occur when reading files from the underlying filesystem. The list below of potential error conditions is informative.
The File or Blob being accessed may not exist at the time one of the asynchronous read methods or synchronous read methods are called. This may be due to it having been moved or deleted after a reference to it was acquired (e.g. concurrent modification with another application). See NotFoundError.
A File or Blob may be unreadable. This may be due to permission problems that occur after a reference to a File or Blob has been acquired (e.g. concurrent lock with another application). Additionally, the snapshot state may have changed. See NotReadableError.
User agents MAY determine that some files are unsafe for use within Web applications. A file may change on disk since the original file selection, thus resulting in an invalid read. Additionally, some file and directory structures may be considered restricted by the underlying filesystem; attempts to read from them may be considered a security violation. See § 9 Security and Privacy Considerations and SecurityError.
This section is normative.
Error conditions can arise when reading a File or a Blob.
The read operation can terminate due to error conditions when reading a File or a Blob; the particular error condition that causes the get stream algorithm to fail is called a failure reason. A failure reason is one of NotFound, UnsafeFile, TooManyReads, SnapshotState, or FileLock.
Synchronous read methods throw exceptions of the type in the table below if there has been an error owing to a particular failure reason.
Asynchronous read methods use the error attribute of the FileReader object, which must return a DOMException object of the most appropriate type from the table below if there has been an error owing to a particular failure reason, or otherwise return null.
| NotFoundError |
If the File or Blob resource could not be found at the time the read was processed,
this is the NotFound failure reason.
For asynchronous read methods the error attribute must return a NotFoundError exception and synchronous read methods must throw a NotFoundError exception. |
| SecurityError |
If:
For asynchronous read methods the error attribute may return a SecurityError exception and synchronous read methods may throw a SecurityError exception. This is a security error to be used in situations not covered by any other failure reason. |
| NotReadableError |
If:
For asynchronous read methods the error attribute must return a NotReadableError exception and synchronous read methods must throw a NotReadableError exception. |
This section defines a scheme for a URL used to refer to Blob and MediaSource objects.
This section is informative.
Blob (or object) URLs are URLs like blob:http://example.com/550e8400-e29b-41d4-a716-446655440000. This enables integration of Blobs and MediaSources with other APIs that are only designed to be used with URLs, such as the img element. Blob URLs can also be used to navigate to as well as to trigger downloads of locally generated data.
For this purpose two static methods are exposed on the URL interface, createObjectURL(obj) and revokeObjectURL(url). The first method creates a mapping from a URL to a Blob, and the second method revokes said mapping. As long as the mapping exist the Blob can’t be garbage collected, so some care must be taken to revoke the URL as soon as the reference is no longer needed. All URLs are revoked when the global that created the URL itself goes away.
Each user agent must maintain a blob URL store. A blob URL store is a map where keys are valid URL strings and values are blob URL Entries.
A blob URL entry consists of an object (of type Blob or MediaSource), and an environment (an environment settings object).
Note: Specifications have to use the obtain a blob object algorithm to access a blob URL entry’s object.
Keys in the blob URL store (also known as blob URLs) are valid URL strings that when parsed result in a URL with a scheme equal to "blob", an empty host, and a path consisting of one element itself also a valid URL string.
Let isAuthorized be true.
If environment is an environment settings object, then set isAuthorized to the result of checking for same-partition blob URL usage with blobUrlEntry and environment.
If isAuthorized is false, then return failure.
Return blobUrlEntry’s object.
Let result be the empty string.
Append the string "blob:" to result.
Let settings be the current settings object
Let origin be settings’s origin.
Let serialized be the ASCII serialization of origin.
If serialized is "null", set it to an implementation-defined value.
Append serialized to result.
Append U+0024 SOLIDUS (/) to result.
Generate a UUID [RFC4122] as a string and append it to result.
Return result.
Let store be the user agent’s blob URL store.
Let url be the result of generating a new blob URL.
Let entry be a new blob URL entry consisting of object and the current settings object.
Set store[url] to entry.
Return url.
Let store be the user agent’s blob URL store;
Let url string be the result of serializing url.
Remove store[url string].
Let store be the user agent’s blob URL store.
Let url string be the result of serializing url with the exclude fragment flag set.
If store[url string] exists, return store[url string]; otherwise return failure.
Futher requirements for the parsing and fetching model for blob URLs are defined in the [URL] and [Fetch] specifications.
This section is informative.
The origin of a blob URL is always the same as that of the environment that created the URL, as long as the URL hasn’t been revoked yet. This is achieved by the [URL] spec looking up the URL in the blob URL store when parsing a URL, and using that entry to return the correct origin.
If the URL was revoked the serialization of the origin will still remain the same as the serialization of the origin of the environment that created the blob URL, but for opaque origins the origin itself might be distinct. This difference isn’t observable though, since a revoked blob URL can’t be resolved/fetched anymore anyway.
Blob URLs can only be fetched from environments where the storage key matches that of the environment where the blob URL was created. Blob URL navigations are not subject to this restriction.
Let blobStorageKey be the result of obtaining a storage key for non-storage purposes with blobUrlEntry’s environment.
Let environmentStorageKey be the result of obtaining a storage key for non-storage purposes with environment.
If blobStorageKey is not equal to environmentStorageKey, then return false.
Return true.
This specification extends the unloading document cleanup steps with the following steps:
Let environment be the Document’s relevant settings object.
Let store be the user agent’s blob URL store;
Remove from store any entries for which the value’s environment is equal to environment.
This needs a similar hook when a worker is unloaded.
Blob URLs are created and revoked using static methods exposed on the URL object. Revocation of a blob URL decouples the blob URL from the resource it refers to, and if it is dereferenced after it is revoked, user agents must act as if a network error has occurred. This section describes a supplemental interface to the URL specification [URL] and presents methods for blob URL creation and revocation.
[Exposed=(Window,DedicatedWorker,SharedWorker)] partial interface URL { static DOMString createObjectURL((Blob or MediaSource) obj); static undefined revokeObjectURL(DOMString url); };Let urlRecord be the result of parsing url.
If urlRecord’s scheme is not "blob", return.
Let entry be urlRecord’s blob URL entry.
If entry is null, then return.
Let isAuthorized be the result of checking for same-partition blob URL usage with entry and the current settings object.
If isAuthorized is false, then return.
Note: This means that rather than throwing some kind of error, attempting to revoke a URL that isn’t registered or that was registered from an environment in a different storage partition will silently fail. User agents might display a message on the error console if this happens.
Note: Attempts to dereference url after it has been revoked will result in a network error. Requests that were started before the url was revoked should still succeed.
Since a user agent has one global blob URL store, it is possible to revoke an object URL from a different window than from which it was created. The URL.revokeObjectURL() call ensures that subsequent dereferencing of myurl results in a the user agent acting as if a network error has occurred.
Blob URLs are strings that are used to fetch Blob objects, and can persist for as long as the document from which they were minted using URL.createObjectURL()—see § 8.3.3 Lifetime of blob URLs.
This section gives sample usage of creation and revocation of blob URLs with explanations.
The example above allows multiple references to a single blob URL, and the web developer then revokes the blob URL string after both image objects have been loaded. While not restricting number of uses of the blob URL offers more flexibility, it increases the likelihood of leaks; developers should pair it with a corresponding call to URL.revokeObjectURL().
This section is informative.
This specification allows web content to read files from the underlying file system, as well as provides a means for files to be accessed by unique identifiers, and as such is subject to some security considerations. This specification also assumes that the primary user interaction is with the <input type="file"/> element of HTML forms [HTML], and that all files that are being read by FileReader objects have first been selected by the user. Important security considerations include preventing malicious file selection attacks (selection looping), preventing access to system-sensitive files, and guarding against modifications of files on disk after a selection has taken place.
Preventing selection loopingDuring file selection, a user may be bombarded with the file picker associated with <input type="file"/> (in a "must choose" loop that forces selection before the file picker is dismissed) and a user agent may prevent file access to any selections by making the FileList object returned be of size 0.
System-sensitive files(e.g. files in /usr/bin, password files, and other native operating system executables) typically should not be exposed to web content, and should not be accessed via blob URLs. User agents may throw a SecurityError exception for synchronous read methods, or return a SecurityError exception for asynchronous reads.
This section is provisional; more security data may supplement this in subsequent drafts.
This section covers what the requirements are for this API, as well as illustrates some use cases. This version of the API does not satisfy all use cases; subsequent versions may elect to address these.
Once a user has given permission, user agents should provide the ability to read and parse data directly from a local file programmatically.
Data should be able to be stored locally so that it is available for later use, which is useful for offline data access for web applications.
User agents should provide the ability to save a local file programmatically given an amount of data and a file name.
Note: While this specification doesn’t provide an explicit API call to trigger downloads, the HTML5 specification has addressed this. The download attribute of the a element initiates a download, saving a File with the name specified. The combination of this API and the download attribute on a elements allows for the creation of files within web applications, and the ability to save them locally.
User agents should provide a streamlined programmatic ability to send data from a file to a remote server that works more efficiently than form-based uploads today.
User agents should provide an API exposed to script that exposes the features above. The user is notified by UI anytime interaction with the file system takes place, giving the user full ability to cancel or abort the transaction. The user is notified of any file selections, and can cancel these. No invocations to these APIs occur silently without user intervention.
This specification was originally developed by the SVG Working Group. Many thanks to Mark Baker and Anne van Kesteren for their feedback.
Thanks to Robin Berjon, Jonas Sicking and Vsevolod Shmyroff for editing the original specification.
Special thanks to Olli Pettay, Nikunj Mehta, Garrett Smith, Aaron Boodman, Michael Nordman, Jian Li, Dmitry Titov, Ian Hickson, Darin Fisher, Sam Weinig, Adrian Bateman and Julian Reschke.
Thanks to the W3C WebApps WG, and to participants on the public-webapps@w3.org listserv
Conformance requirements are expressed with a combination of descriptive assertions and RFC 2119 terminology. The key words “MUST”, “MUST NOT”, “REQUIRED”, “SHALL”, “SHALL NOT”, “SHOULD”, “SHOULD NOT”, “RECOMMENDED”, “MAY”, and “OPTIONAL” in the normative parts of this document are to be interpreted as described in RFC 2119. However, for readability, these words do not appear in all uppercase letters in this specification.
All of the text of this specification is normative except sections explicitly marked as non-normative, examples, and notes. [RFC2119]
Examples in this specification are introduced with the words “for example” or are set apart from the normative text with class="example", like this:
Informative notes begin with the word “Note” and are set apart from the normative text with class="note", like this:
Note, this is an informative note.
Requirements phrased in the imperative as part of algorithms (such as "strip any leading space characters" or "return false and abort these steps") are to be interpreted with the meaning of the key word ("must", "should", "may", etc) used in introducing the algorithm.
Conformance requirements phrased as algorithms or specific steps can be implemented in any manner, so long as the end result is equivalent. In particular, the algorithms defined in this specification are intended to be easy to understand and are not intended to be performant. Implementers are encouraged to optimize.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
FileReaderSync/readAsArrayBuffer
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.
In all current engines.