[Federal Register Volume 86, Number 126 (Tuesday, July 6, 2021)] [Proposed Rules] [Pages 35429-35443] From the Federal Register Online via the Government Publishing Office [www.gpo.gov] [FR Doc No: 2021-14325] ======================================================================= ----------------------------------------------------------------------- DEPARTMENT OF COMMERCE Patent and Trademark Office 37 CFR Part 1 [Docket No. PTO-P-2021-0006] RIN 0651-AD53 Standard for Presentation of Nucleotide and Amino Acid Sequence Listings Using XML (eXtensible Markup Language) in Patent Applications To Implement WIPO Standard ST.26; Incorporation by Reference AGENCY: United States Patent and Trademark Office, Department of Commerce. ACTION: Notice of proposed rulemaking. ----------------------------------------------------------------------- SUMMARY: The United States Patent and Trademark Office (USPTO or Office) is proposing to revise the rules of practice for submitting biological sequence data associated with disclosures of nucleotide and amino acid sequences in patent applications by incorporating by reference the provisions of Standard ST.26 into the USPTO rules. Other conforming changes to accommodate for proposed new rules of practice based on the new standard are also included. These proposed amendments would apply to international and national applications filed on or after January 1, 2022. In addition to simplifying the process for applicants filing in multiple countries, a requirement to submit a single sequence listing in eXtensible Mark-up Language (XML) format will result in better preservation, accessibility, and sorting of the submitted sequence data for the public. DATES: Comments must be received by September 7, 2021 to ensure consideration. ADDRESSES: For reasons of Government efficiency, comments must be submitted through the Federal eRulemaking Portal at www.regulations.gov. To submit comments via www.regulations.gov, enter docket number PTO-P-2021-0006 on the homepage and click ``Search.'' The site will provide a search results page listing all documents associated with this docket. Find a reference to this notice and click on the ``Comment Now!'' icon, complete the required fields, and enter or attach your comments. Attachments to electronic comments will be accepted in ADOBE[supreg] portable document format or MICROSOFT WORD[supreg] format. Because comments will be made available for public inspection, information that the submitter does not desire to make public, such as an address or phone number, should not be included in the comments. Visit the Federal eRulemaking Portal website (www.regulations.gov) for additional instructions on providing comments via the portal. If electronic submission of comments is not feasible due to lack of access to a computer and/or the internet, please contact the USPTO using the contact information below for special instructions. FOR FURTHER INFORMATION CONTACT: Mary C. Till, Senior Legal Advisor, Office of Patent Legal Administration, Office of the Deputy Commissioner for Patents, by email at [email protected]; or Ali Salimi, Senior Legal Advisor, Office of Patent Legal Administration, Office of the Deputy Commissioner for Patents, by email at [email protected]. Contact via telephone at 571-272-7704 for special instructions on submission of comments. SUPPLEMENTARY INFORMATION: Table of Contents I. Background a. Summary of Changes b. Introduction c. Standard ST.26 d. Benefits e. WIPO Authoring and Validation Tool (WIPO Sequence) f. Applicability II. Discussion of Specific Rules III. Rulemaking Considerations I. Background a. Summary of Changes Standard ST.26 is the new international standard developed by the World Intellectual Property Organization (WIPO) and member states and adopted by the same. Under Standard ST.26, patent applications that contain disclosures of nucleotides and/or amino acid sequence(s) must present [[Page 35430]] the associated biological sequence data in a standardized electronic format (a ``Sequence Listing XML'') as a separate part of the specification. Under the proposed rules, in international applications filed under the Patent Cooperation Treaty (PCT) and in national and regional applications in Intellectual Property Offices (IPOs) of WIPO member states, an applicant will have to submit a single internationally acceptable sequence listing in a language neutral format using specified International Nucleotide Sequence Database Collaboration (INSDC) identifiers, such that a single sequence listing can be prepared for worldwide use. The proposed rule changes include: (1) Creation of new rules (Sec. Sec. 1.831 through 1.835) to incorporate by reference Standard ST.26; (2) use of INSDC sequence data elements to replace numeric identifiers from the previous standard; (3) modification of rules of practice to include reference to ``Sequence Listing XML;'' (4) elimination of a paper or PDF copy of the sequence listing; (5) elimination of the option to include within a sequence listing sequences with fewer than 4 amino acids and fewer than 10 nucleotides; and (6) clarification and simplification of the rules to aid in understanding of the requirements that they set forth. b. Introduction The sequence rules (37 CFR 1.821 through 1.825) provide a standardized format for description of nucleotide and amino acid sequence data in patent applications and require the submission of such sequences in computer readable form (CRF). The current USPTO rules are based on WIPO Standard ST.25, which became effective in 1998, and use a flat file structure of numeric identifiers using a limited set of character codes. A new international standard, ST.26, was agreed upon by WIPO member states, and would apply to international and national applications filed on or after January 1, 2022. Applications pending prior to January 1, 2022, would not have to comply with Standard ST.26. In an effort to streamline and reduce the procedural requirements found in the existing rules, and to respond to the needs of our customers to conform to Standard ST.26, the USPTO is proposing to amend its rules of practice for submitting biological sequence data associated with disclosures of nucleotide and amino acid sequences in patent applications filed on or after January 1, 2022, to comply with Standard ST.26. To decrease the burden on applicants who file applications containing nucleotide and amino acid sequence information internationally, the USPTO has worked with other WIPO member states as part of the Committee on WIPO Standards (CWS) to develop a single internationally acceptable sequence listing standard for use in patent applications filed in those states. Beginning in October of 2010, the CWS established a Task Force to propose a revised standard for the filing of nucleotide and/or amino acid sequence listings in XML file format (hereinafter referred to as a ``Sequence Listing XML''). In order to obtain public input on the content of Standard ST.26, the USPTO issued Requests for Comments in 2012 and 2016 (``Request for Comments on the Recommendation for the Disclosure of Sequence Listings Using XML (Proposed ST.26).'' (See 77 FR 28541 (May 15, 2012)) and ``Standard ST.26-Request for Comments on the Recommended Standard for the Presentation of Nucleotide and Amino Acid Sequence Listings using XML (eXtensible Markup Language).'' (See 81 FR 74775 (October 27, 2016))). The adopted version of Standard ST.26 takes those comments into account. To achieve the goals that WIPO and WIPO member states (including the United States) set out by developing the sequence listing standard for presenting data consistently across all IPOs, all WIPO member states agreed to implement ST.26 for international and national applications filed on or after January 1, 2022. Therefore, upon finalizing the proposed rules, applications filed electronically in the United States on or after January 1, 2022, would need to conform to Standard ST.26, which requires submitting sequence listings in XML format. The USPTO is further proposing that applications that claim benefit or priority to an earlier application, where the earlier application contained a sequence listing that complied with the Standard ST.25 sequence rules, comply with the new rules that incorporate by reference Standard ST.26. In order to facilitate compliance, WIPO Sequence, a sequence listing authoring and validating tool, has been developed by WIPO with input from WIPO member states so that applicants can use it to prepare and validate their sequence listings in XML format as discussed infra. The USPTO is proposing to add to the patent rules (37 CFR part 1) by incorporating by reference Standard ST.26, and providing conforming amendments to the current rules. To ensure that biological sequence data associated with the disclosures of nucleotides and/or amino acid sequence(s) in patent applications can be widely disseminated and searchable by the public and IPOs, the USPTO works with the National Center for Biotechnology Information (NCBI) for inclusion of patent sequence data in the GenBank searchable database. For NCBI to include all sequence data from the USPTO, the data must be provided in INSDC format so that it is compatible with GenBank. The Standard ST.25 format sequence listings cannot be readily converted to INSDC format, resulting in only a fraction of patent sequence information appearing in GenBank. This data loss limits the sequence information provided to the public and exchanged with other sequence database providers, e.g., the National Institute of Genetics (NIG) in Japan, the DNA Data Bank of Japan (DDBJ) and European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI). WIPO has been working with the WIPO member states to create, adopt, and implement Standard ST.26 for sequence listing submissions in XML file format having the INSDC data elements to address the data loss. Standard ST.26 aims to enhance the accuracy and quality of biological sequence data that is publicly disseminated. In adopting and implementing Standard ST.26, more complete biological sequence data from patents and patent applications will be included in GenBank and thus be accessible by the public. The change from ASCII format to XML format will result in sequence data having computer tags that facilitate sorting and retrieving, and permit ease of access to the data. Additionally, NCBI is planning to stop accepting data in Standard ST.25 format for inclusion in GenBank in about 3-5 years after January 1, 2022 (the Standard ST.26 transition date). c. Standard ST.26 The WIPO ``Handbook on Industrial Property Information and Documentation'' sets forth standards for the presentation of data in many contexts. Standard ST.26 is titled ``Recommended Standard for the Presentation of Nucleotide and Amino Acid Sequence Listings Using XML (eXtensible Markup Language).'' Adoption of the current version, version 1.4, by the CWS, occurred in December of 2020 and reaffirms that January 1, 2022, is expected to be the implementation date for all WIPO member states. The proposed USPTO rules incorporate by reference Standard ST.26. [[Page 35431]] The adopted version of Standard ST.26 is composed of eight documents, namely, the main body of the Standard, a first annex setting forth the controlled vocabulary for use with the main body, a second annex setting forth the Document Type Definition (DTD) for the Standard, a third annex containing a sequence listing specimen, a fourth annex setting forth the character subset from the Unicode Basic Latin Code Table, a fifth annex setting forth additional data exchange requirements for IPOs, a sixth annex containing a guidance document, and a seventh annex setting forth recommendations for the transformation of a sequence listing from Standard ST.25 format to Standard ST.26 format including avoiding adding or deleting subject matter. These materials can be found at http://www.wipo.int/export/sites/www/standards/en/pdf/03-26-01.pdf. The main body of Standard ST.26 defines the disclosures of nucleotide and amino acid sequences in patent applications that must be presented in a sequence listing in XML format in the manner specified in the Standard. Specifically, as detailed in paragraph eight of the main body, a sequence listing must not include, as a sequence assigned its own sequence identification number, any sequences having fewer than ten specifically defined nucleotides, or fewer than four specifically defined amino acids. The main body establishes the requirements for representation of nucleotide and amino acid sequences and the requirements for the XML file format for a sequence listing. The first annex contains controlled vocabulary that provides nucleotide base codes, lists of modified nucleotides and their abbreviations, amino acid codes, and a list of modified amino acids and their abbreviations. In addition, the first annex provides defined feature keys and qualifiers used for nucleotide and amino acid sequences in the XML file for a sequence listing. This first annex specifically identifies qualifiers with language-dependent ``free text'' values that may require translation for national and regional procedures. The second annex provides the DTD setting forth the technical specifications to which a submitted Sequence Listing XML must conform. The third annex provides a specimen of a Standard ST.26 compliant sequence listing that shows a representation of an entire sequence listing in XML format. Annex IV provides a table of the character subset from the Unicode Basic Latin Code that will be used for a ``Sequence Listing XML.'' Annex V provides guidance to WIPO member states on how certain sequence elements should be populated when data is exchanged with database providers. Annex VI, containing the guidance document, is provided to ensure that all applicants and WIPO member states understand the requirements for inclusion and representation of sequence disclosures. This guidance document was developed, in part, to address concerns raised in response to the USPTO's requests for comment in 2012 and 2016, mentioned above. The guidance document illustrates the requirements of selected paragraphs found in the main body of Standard ST.26 through specific examples of nucleotide and amino acid biological sequence data. Additionally, the document provides guidance on the manner in which biological sequence data is represented within a Standard ST.26 compliant sequence listing in XML format. Annex VII addresses the potential consequence of these requirements when transforming a compliant Standard ST.25 sequence listing to a Standard ST.26 sequence listing, and provides detailed guidance on avoiding added or deleted subject matter due to the additional requirements of Standard ST.26. d. Benefits Transitioning from rules based on Standard ST.25 (i.e., the current basis for the USPTO rules for ``Sequence Listings'') to rules based on Standard ST.26 will be beneficial to both patent applicants filing sequence listings and IPOs receiving applications containing disclosures of nucleotide and amino acid sequences. Standard ST.26 provides clear requirements as to what must be included in a sequence listing, and how sequences must be represented. For example, it standardizes the representation of modified nucleotide sequences and amino acid sequences as well as variants derived from primary sequences. Since Standard ST.26 contains a guidance document that illustrates the requirements for inclusion and representation of biological sequence data, patent applicants will have a clearer understanding of the requirements for presentation of biological sequence data in a compliant sequence listing under Standard ST.26. Additionally, since Standard ST.26 only allows XML format, the potential for differences under the current rules between a sequence listing filed in paper/PDF format and the required electronic CRF will be eliminated. As a further benefit, IPOs of WIPO member states will no longer need to expend resources to process paper sequence listings and perform necessary checks on the contents of paper documents. Unlike rules based on Standard ST.25, rules based on Standard ST.26 will allow patent applicants to file a single sequence listing with the USPTO (with the exception of changes to comply with national language requirements) that will be acceptable to the IPOs of WIPO member states. Under Standard ST.25, IPOs have interpreted and enforced rules differently due to the imprecise language in the previous Standard. This has resulted in the frustrating situation where applicants generate sequence listings that may be accepted in one IPO but not another. Standard ST.26 was drafted to precisely define what must and must not be included in a sequence listing, and how sequences must be represented in a sequence listing. The ``Guidance document with illustrated examples'' in Annex VI of Standard ST.26 illustrates the application of the rules to real-world sequence disclosure examples, eliminating the possibility of misinterpretation by IPOs or applicants. Due to the improved data structure of XML, transitioning from the current USPTO rules based on Standard ST.25 to rules based on Standard ST.26 will have the effect of increasing the quality of examination of patent applications containing biological sequence data since a more comprehensive search will be possible. Sequence listings submitted in accordance with Standard ST.26 allow for targeted searching of both sequence annotation and newly required sequence types, such as D-amino acids, nucleotide analogues, and linear portions of branched sequences. Finally, sequence listing submissions under rules based on Standard ST.26 will enhance public database content, as they include the sequence annotations (e.g., feature keys and qualifiers) used by database providers to describe biological sequence data. Standard ST.26 standardizes sequence variant presentation, annotation of modified and unusual residues, feature location descriptors, use of feature keys and qualifiers, organism names, and presentation of coding regions. Incorporation by reference of Standard ST.26 into USPTO rules has the effect of promoting data exchange between USPTO and NCBI due to use of INSDC identifiers required by database providers. The presence of additional data, as well as the enhanced compatibility to facilitate the exchange of data, will increase the value of database searches for biotechnology stakeholders that relate to nucleotide and amino acid sequences. [[Page 35432]] The USPTO recommends requiring compliance with Standard ST.26 for an application filed on or after January 1, 2022, because it will reduce the complexity and cost of long-term maintenance of IT systems for accepting sequence listings in multiple formats, provide a clear implementation date, and will facilitate transition to the format requirements of database providers. In addition, a requirement to submit a single sequence listing in XML format will result in better preservation, accessibility, and sorting of the submitted sequence data for the public. As noted herein, WIPO has created a tool to assist applicants with translation of existing sequences to the new standard. e. WIPO Authoring and Validation Tool (WIPO Sequence) To comply with rules that are based on Standard ST.26, patent applicants will be able to use ``WIPO Sequence,'' a freely-available desktop application developed by WIPO and adopted by WIPO member states, to generate a Standard ST.26 compliant sequence listing. WIPO Sequence has two functions: An authoring function and a validation function. Patent applicants will be able to author and validate their sequence listing using WIPO Sequence to comply with the requirements of Standard ST.26. Such a sequence listing will be accepted by all IPOs of WIPO member states. Thus, the burden of generating a sequence listing which is acceptable across all WIPO member states will be significantly decreased for patent applicants under Standard ST.26. This tool will be downloadable, free of charge, from the WIPO website. Currently, a beta version of WIPO Sequence is accessible at https://www.wipo.int/standards/en/sequence/index.html. This beta version will allow the public to familiarize themselves with the tool and its dual functionalities. WIPO Sequence will allow a user to create and save patent application data and biological sequence data in a project, validate the project to ensure all required information is present, and generate a sequence listing in Standard ST.26 XML format. Information can be entered into a project manually, or data can be imported from a source file in one of a number of file types. WIPO Sequence can import data from other Standard ST.26 projects, Standard ST.26 XML sequence listings, Standard ST.25 sequence listing text files, raw files, multi- sequence format files, and FASTA (FAST-All-a DNA and protein sequence alignment software package) files. Feature keys, qualifiers, and organism names are available to select from drop-down lists, simplifying the creation of sequence listings. Applicant and inventor names, as well as custom organism names, can be stored within WIPO Sequence for ease of access. To facilitate review of data entered into a project, WIPO Sequence can generate a ``human-readable'' version of the sequence listing in addition to the XML sequence listing. WIPO Sequence includes an integrated validation function that will alert users to most errors in a project or sequence listing data. The validation function generates a report that clearly lists every detected error, the location of the error, and the detected value of the error, along with a link to the sequence in question, thereby ensuring users can correct errors before generating a final sequence listing. While the validation function will alert a user to most errors that are contained in a project or sequence listing, there are a small number of errors that can be detected only by human review (for example, an inappropriate organism name). In those cases, the integrated validation function will list a ``warning'' in the validation report, reminding users of the applicable/relevant rule and urging them to check their input values before generating a final sequence listing. A sequence listing in Standard ST.25 format cannot automatically be converted into Standard ST.26 format because certain data elements required for a Standard ST.26 compliant sequence listing are not present in Standard ST.25. Therefore, conversion of a sequence listing in Standard ST.25 format to Standard ST.26 format necessarily requires additional input from the applicant. WIPO Sequence supplemented by significant guidance from WIPO and USPTO (in Annex VI and Annex VII of Standard ST.26) will help applicants accomplish this task. Users can import a Standard ST.25 sequence listing into a project, and WIPO Sequence automatically performs many of the necessary conversions. An Import Report is generated that alerts the user to all data conversions, and lists all sequence entries that require additional input. In response to concerns raised in comment to the USPTO's requests for comments in 2012 and 2016, the USPTO, in conjunction with WIPO, developed Annex VII to provide detailed guidance to help applicants avoid added or deleted subject matter when converting a sequence listing from Standard ST.25 format into Standard ST.26 format. In order to ensure that IPOs can validate and accept sequence listing projects from applicants generated with WIPO Sequence, WIPO is developing a Standard ST.26 sequence listing validation tool, WIPO Sequence Validator. WIPO Sequence Validator will be for use by IPOs. WIPO Sequence Validator will be synchronized with the validation function in the WIPO Sequence tool. The USPTO is integrating WIPO Sequence Validator into its internal IT systems. The WIPO Sequence Validator will apply the same validation rules as WIPO Sequence. Therefore, filers will have a greater level of confidence that a sequence listing authored and validated by WIPO Sequence will comply with the USPTO rules for ``Sequence Listing XMLs'' (Sec. Sec. 1.831 through 1.835) and accepted since the WIPO Sequence Validator that USPTO will use is based on Standard ST.26, which is incorporated by reference into the USPTO proposed rules of practice. e. Applicability In accordance with these proposed rules of practice, an application that has a filing date on or after January 1, 2022, would be required to contain a sequence listing in accordance with proposed Sec. Sec. 1.831 through 1.835, which incorporate by reference Standard ST.26. This includes applications that claim priority to applications with filing dates before January 1, 2022. Such applications include but are not limited to applications having one or more benefit or priority claims under 35 U.S.C. 119(e) (claiming the benefit of a provisional), section 120 (claiming the benefit as a continuation and/or continuation-in-part), section 121 (claiming the benefit as a divisional), section 365 (claiming the benefit as a continuation or continuation in part to a PCT application), or section 119(a)-(d) (claiming the benefit to a foreign filed application or a prior filed PCT). If a prior application to which benefit or priority is claimed contains a sequence listing in Standard ST.25 format, the applicant would be required to convert that sequence listing to Standard ST.26 format for inclusion in the new application filed on or after January 1, 2022. As provided in 35 U.S.C. 363, the filing date of an international stage application is also the filing date for the national stage application filed under 35 U.S.C. 371. Accordingly, for applications filed under 35 U.S.C. 371, compliance with Standard ST.26 is based on the international filing date of the corresponding international application, rather than the date of submission of the national stage application in the USPTO. The proposed rules would also be applicable to applications for reissue without [[Page 35433]] regard to the filing date of the originally granted patent for which reissue is sought. That is, any reissue application filed on or after January 1, 2022, where the disclosure or claims contain nucleotide and amino acid sequences would be required to comply with proposed Sec. Sec. 1.831 through 1.835. Relying on the actual filing date of an application to determine whether a sequence listing must conform to Sec. Sec. 1.821 through 1.825 (rules based on Standard ST.25) or Sec. Sec. 1.831 through 1.835 (rules based on Standard ST.26) will simplify the application of the sequence rules, both for the USPTO and the applicant. II. Discussion of Specific Rules Section 1.52: Paragraph (e)(1)(ii) was proposed to be amended in another rulemaking that published at 86 FR 28301 (May 26, 2021). This proposed rule would further amend that paragraph to include reference to a ``Sequence Listing XML'' submitted under Sec. 1.831(a) in compliance with Sec. Sec. 1.832 through 1.834. Section 1.52(e)(3)(iv) is proposed to be added to require that the contents of each read-only optical disc for a ``Sequence Listing XML'' must be in XML file format and, if compressed, must be compressed in accordance with Sec. 1.834. Section 1.52(e)(7) was proposed to be amended in another rulemaking that published at 86 FR 28301 (May 26, 2021). This proposed rule would further amend that paragraph to require that any amendment to the information on a read-only optical disc submitted in relation to a ``Sequence Listing XML'' be in accordance with Sec. 1.835(b). Section 1.52(f)(1) was proposed to be amended in another rulemaking that published at 86 FR 28301 (May 26, 2021). This proposed rule would further amend that paragraph to indicate that any XML file submitted on a read-only optical disc is excluded from the application size fee determination if the read-only optical disc contains a ``Sequence Listing XML'' in compliance with Sec. 1.831(a). The provision at 35 U.S.C 41(a)(1)(G) provides the basis for excluding ``any sequence listing,'' when filed in electronic medium, from the application size fee determination. A ``Sequence Listing XML'' is considered ``any sequence listing.'' Section 1.52(f)(1)(i) was proposed to be added in another rulemaking that published at 86 FR 28301 (May 26, 2021). This proposed rule would further amend that paragraph to reference any ``Sequence Listing XML'' in compliance with Sec. 1.831(a). Section 1.52(f)(2) was proposed to be amended in another rulemaking that published at 86 FR 28301 (May 26, 2021). This proposed rule would further amend that paragraph to indicate that any XML file, submitted via the USPTO patent electronic filing system for a ``Sequence Listing XML'' in compliance with Sec. 1.831(a) is excluded from the application size fee determination. The provision at 35 U.S.C 41(a)(1)(G) provides the basis for excluding ``any sequence listing'' when filed in electronic medium from the application size fee determination. A ``Sequence Listing XML'' is considered ``any sequence listing.'' Section 1.52(f)(2)(i) was proposed to be added in another rulemaking that published at 86 FR 28301 (May 26, 2021). This proposed rule would further amend that paragraph to reference any ``Sequence Listing XML'' in compliance with Sec. 1.831(a). Section 1.52(f)(3) was proposed to be added in another rulemaking that published at 86 FR 28301 (May 26, 2021). This proposed rule would further amend that paragraph to subject any ``Sequence Listing XML'' of 300MB-800MB to the surcharge set forth in Sec. 1.21(o)(1) and any ``Sequence Listing XML'' over 800MB to the surcharge set forth in Sec. 1.21(o)(2). Section 1.53: Section 1.53(c)(4) is proposed to be revised to indicate that a separate sequence listing in a provisional application disclosing nucleotide and/or amino acid sequences is not required but, any biological sequence data submitted in a provisional application filed on or after January 1, 2022, must be a ``Sequence Listing XML'' in compliance with Sec. Sec. 1.831 through 1.834. This proposed change is not anticipated to cover applications filed before January 1, 2022. Section 1.77: Section 1.77(b)(5) was proposed to be amended in another rulemaking that published at 86 FR 28301 (May 26, 2021). This proposed rule would further amend that paragraph by reorganizing under Sec. 1.77(b)(5)(i) the provisions for an incorporation by reference statement for ASCII plain text tiles submitted for a ``Computer Program Listing Appendix'' (Sec. 1.77(b)(5)(i)(A)), a ``Sequence Listing'' (Sec. 1.77(b)(5)(i)(B)), and ``Large Tables'' (Sec. 1.77(b)(5)(i)(C)). Section 1.77(b)(5)(ii) would contain provisions for an incorporation by reference statement for a ``Sequence Listing XML'' submitted via a USPTO patent electronic filing system or on one or more read-only optical discs. There would be no Sec. 1.77(b)(5)(iii). Section 1.121: Section 1.121(b) was proposed to be amended in another rulemaking that published at 86 FR 28301 (May 26, 2021). This proposed rule would further amend that paragraph to add an exception to amendment practice for ``Sequence Listing XML''s (Sec. 1.835). Section 1.121(b)(6) was proposed to be added in another rulemaking that published at 86 FR 28301 (May 26, 2021). This proposed rule would further amend that paragraph to require that changes to a ``Sequence Listing XML'' be made in accordance with Sec. 1.835. Section 1.173: The heading of Sec. 1.173(b)(1) was proposed to be amended in another rulemaking that published at 86 FR 28301 (May 26, 2021). This proposed rule would further amend that heading to include ` ``Sequence Listing XML' (Sec. 1.831(a)).'' Section 1.173(b)(1)(i) was proposed to be added in another rulemaking that published at 86 FR 28301 (May 26, 2021). This proposed rule would further amend that paragraph to add an exception to reissue amendment practice for a ` ``Sequence Listing XML' (Sec. 1.831(a)).'' Section 1.173(b)(1)(ii) was proposed to be added in another rulemaking that published at 86 FR 28301 (May 26, 2021). This proposed rule would further amend that paragraph to provide that changes to a ``Sequence Listing XML'' must be made in accordance Sec. 1.835. Section 1.173(d) was proposed to be amended in another rulemaking that published at 86 FR 28301 (May 26, 2021). This proposed rule would further amend that paragraph to also exclude a ``Sequence Listing XML'' from the manner of making amendments in a reissue application. Section 1.211: Section 1.211(c) is proposed to be amended to add a ``Sequence Listing'' in compliance with Sec. Sec. 1.821 through 1.825 (if applicable) for an application filed before January 1, 2022, and a ``Sequence Listing XML'' in compliance with Sec. Sec. 1.831 through 1.835 (if applicable) for an application filed on or after January 1, 2022, to the currently listed items that may delay application publication if not present. Section 1.495: Section 1.495(c)(5) is proposed to be amended to delineate between translations needed for a ``Sequence Listing'' in international applications entering the national stage in the United States having an international filing date before January 1, 2022, and a ``Sequence Listing'' in XML format for international applications entering the national stage in the United States having an international filing date on or after January 1, 2022. Specifically, the [[Page 35434]] proposed amendment indicates that a ``Sequence Listing'' need not be translated for national stage entry if the ``Sequence Listing'' complies with PCT Rule 12.1(d) and the description complies with PCT Rule 5.2(b) for applications having an international filing date before January 1, 2022. However, the proposed amendment indicates that a ``Sequence Listing'' in XML format must be translated for national stage entry if a ``Sequence Listing'' in XML format was submitted in an international application with non-English language values for the invention title and/or any language-dependent free text qualifiers and has an international filing date on or after January 1, 2022. Section 1.530: The heading of Sec. 1.530(d)(1) was proposed to be amended in another rulemaking that published at 86 FR 28301 (May 26, 2021). This proposed rule would further amend that heading to include ` ``Sequence Listing XML' (Sec. 1.831(a)).'' Section 1.530(d)(1)(i) was proposed to be added in another rulemaking that published at 86 FR 28301 (May 26, 2021). This proposed rule would further amend that paragraph to add an exception to reexamination amendment practice for a ` ``Sequence Listing XML' (Sec. 1.831(a)).'' Section 1.530(d)(1)(ii) was proposed to be added in another rulemaking that published at 86 FR 28301 (May 26, 2021). This proposed rule would further amend that paragraph to provide that changes to a ``Sequence Listing XML'' must be made in accordance with Sec. 1.835. Section 1.704: Section 1.704(f) is proposed to be amended to add a ``Sequence Listing XML'' in compliance with Sec. Sec. 1.831 through 1.835 (if applicable) to the list of items that are required for an application filed under 35 U.S.C. 111(a) to be in condition for examination for purposes of calculating a reduction in patent term adjustment. The amendment also proposes to add a ``Sequence Listing XML'' in compliance with Sec. Sec. 1.831 through 1.835 (if applicable) to the list of items that must be submitted in an international application for such an application to be in condition for examination when the application has entered the national stage as defined in Sec. 1.491(b). Lastly, the rule is also proposed to be amended to add a ``Sequence Listing XML'' in compliance with Sec. Sec. 1.831 through 1.835 (if applicable) to the current list of items for which an application is considered to be in compliance, for purposes of determining a patent term adjustment reduction, on the filing date of the latest reply (if any) correcting the papers, drawings, or sequence listing that is prior to the date of mailing of either an action under 35 U.S.C. 132 or a notice of allowance under 35 U.S.C. 151, whichever occurs first. Section 1.831: Section 1.831 is proposed to be added to require that patent applications having disclosures of nucleotide and amino acid sequences, as those terms are defined in the rule, must contain, as a separate part of the disclosure, a ``Sequence Listing XML'' for patent applications having a filing date on or after January 1, 2022. Section 1.831(a) is proposed to be added to specify that the ``Sequence Listing XML'' uses the symbols and format in accordance with the requirements of Sec. Sec. 1.832 through 1.834. Section 1.831(b)(1) and (2) are proposed to be added to define the nucleotide and amino acid sequences that are encompassed by the rule for which a ``Sequence Listing XML'' is needed. Specifically, nucleotide and/or amino acid sequences as used in these proposed rules encompass: An unbranched sequence or linear region of a branched sequence containing four or more specifically defined amino acids, wherein the amino acids form a single peptide backbone or an unbranched sequence or linear region of a branched sequence of 10 or more specifically defined nucleotides, wherein adjacent nucleotides are joined by: A 3' to 5' (or 5' to 3') phosphodiester linkage or, for nucleotide analogs, any chemical bond that results in an arrangement of adjacent nucleobases that mimics the arrangement of nucleobases in naturally occurring nucleic acids. Section 1.831(c) is proposed to be added to state that, where the description or claims of a patent application discuss a sequence that is set forth in the ``Sequence Listing XML'' in accordance with paragraph (a) of this section, reference must be made to the sequence by use of the sequence identifier, preceded by SEQ ID NO: Or the like in the text of the description or claims, even if the sequence is also embedded in the text of the description or claims of the patent application. The use of SEQ ID NO: Is preferred but including ``or the like'' is intended to ensure that a formalities notice is not sent when an application uses, for example, ``SEQ NO.'' or ``Seq. Id. No.'' or any similar identification of an amino acid or nucleotide sequence in the specification or claims where it is clear that a sequence from the ``Sequence Listing XML'' is shown in the specification or claims. In identifying the sequence in the description or claims, the numeric sequence identifier from the ``Sequence Listing XML'' must be identifying the same sequence. Section 1.831(d) is proposed to be added to define the expression ``enumeration of its residues,'' consistent with the definition in Paragraph 3(c) of WIPO Standard ST.26 itself (which is incorporated by reference herein). Section 1.831(e) is proposed to be added to define the expression ``specifically defined,'' consistent with the definition in Paragraph 3(m) of WIPO Standard ST.26 (2020). Section 1.831(f) is proposed to be added to define the expression ``amino acid,'' consistent with the definition in Paragraph 3(a) of WIPO Standard ST.26 (2020). Section 1.831(g) is proposed to be added to define the expression ``modified amino acid,'' consistent with the definition in Paragraph 3(g) of WIPO Standard ST.26 (2020). Section 1.831(h) is proposed to be added to define the expression ``nucleotide,'' consistent with Paragraphs 3(h) and 3(i) of WIPO Standard ST.26 (2020). Section 1.831(i) is proposed to be added to define the expression ``modified nucleotide,'' consistent with Paragraph 3(h) of WIPO Standard ST.26 (2020). Section 1.832: Section 1.832 is proposed to be added to provide the manner in which a nucleotide and/or amino acid sequence is presented in the ``Sequence Listing XML'' part of a patent application having a filing date on or after January 1, 2022. Section 1.832(a) is proposed to be added to define the requirements for representation of sequences in a ``Sequence Listing XML'' part of the application. Specifically, each nucleotide and/or amino acid sequence presented in the ``Sequence Listing XML'' must be assigned a separate sequence identifier, and the sequence identifiers must begin with the number 1, and increase sequentially by integers as defined in Paragraph 10 of WIPO Standard ST.26 (2020). Section 1.832(b)(1) through (4) are proposed to be added to define the requirements for representation of nucleotide sequence data in the ``Sequence Listing XML.'' Specifically, a nucleotide sequence must be represented in the manner described in Paragraphs 11-12 of WIPO Standard ST.26 (2020). All nucleotides, including nucleotide analogs, modified nucleotides, ``unknown'' nucleotides in a nucleotide sequence must be represented and described using symbols in the manner described in Paragraphs 13-19 and 21 of WIPO [[Page 35435]] Standard ST.26 (2020). For a region containing a known number of contiguous ``a'', ``c'', ``g'', ``t'', or ``n'' residues for which the same description applies, the entire region may be jointly described as provided in Paragraph 22 of WIPO Standard ST.26 (2020). Section 1.832(c)(1) through (4) are proposed to be added to define the requirements for representation of amino acid sequence data in the ``Sequence Listing XML.'' Specifically, an amino acid sequence must be represented in the manner described in Paragraphs 24-25 of WIPO Standard ST.26 (2020). All amino acids, including modified amino acids and ``unknown'' amino acids, in an amino acid sequence must be represented and described using symbols in the manner described in Paragraphs 24-30 and 32 of WIPO Standard ST.26 (2020). For a region containing a known number of contiguous ``X'' residues for which the same description applies, the entire region may be jointly described as provided in Paragraph 34 of WIPO Standard ST.26 (2020). Section 1.832(d) is proposed to be added to define the manner in which a single continuous sequence, derived from one or more non- contiguous segments of a larger sequence, or from segments of different sequences, must be represented, as described in Paragraph 35 of WIPO Standard ST.26 (2020). Section 1.832(e) is proposed to be added to define the manner in which a nucleotide and/or amino acid sequence that contains regions of specifically defined residues separated by one or more regions of contiguous ``n'' or ``X'' residues of specified length must be represented, as described in Paragraph 36 of WIPO Standard ST.26 (2020). Section 1.832(f) is proposed to be added to define the manner in which nucleotide and/or amino acid sequence that contains regions of specifically defined residues separated by one or more gaps of an unknown or undisclosed number of residues must be represented, as described in Paragraph 37 of WIPO Standard ST.26 (2020). Section 1.833: Section 1.833 is proposed to be added to describe the requirements for a ``Sequence Listing XML,'' which is required by Sec. 1.831(a) for patent applications with a filing date on or after January 1, 2022, in order to comply with WIPO Standard ST.26 (2020). Section 1.833(a) is proposed to be added to require that the ``Sequence Listing XML'' must be presented as a single XML 1.0 file and encoded using Unicode UTF-8. Section 1.833(b)(1) is proposed to be added to require that the ``Sequence Listing XML'' must be valid according to the DTD as presented in Annex II of WIPO Standard ST.26 (2020). Section 1.833(b)(2) is proposed to be added to require that a ``Sequence Listing XML'' must comply with the list of items enumerated in (i)-(v) which are found in WIPO Standard ST.26 (2020). Section 1.833(b)(2)(i) is proposed to be added to require that the ``Sequence Listing XML'' contain an XML declaration as defined in WIPO Standard ST.26 (2020), Paragraph 39. Section 1.833(b)(2)(ii) is proposed to be added to require that the ``Sequence Listing XML'' contain a document type declaration as defined in WIPO Standard ST.26 (2020), Paragraph 39. Section 1.833(b)(2)(iii) is proposed to be added to require that the ``Sequence Listing XML'' contain a root element as defined in WIPO Standard ST.26 (2020), Paragraph 43. Section 1.833(b)(2)(iv) is proposed to be added to require that the ``Sequence Listing XML'' contain a general information part that complies with WIPO Standard ST.26 (2020), Paragraphs 45, 47 and 48, as applicable. Section 1.833(b)(2)(v) is proposed to be added to require that the ``Sequence Listing XML'' contain a sequence data part that complies with WIPO Standard ST.26 (2020), Paragraphs 50-55, 57-58, 60-69, 71-78, 80-87, 89-98 and 100, as applicable. Section 1.833(b)(3) is proposed to be added to require that the ``Sequence Listing XML'' contains at least one InventionTitle element, as set forth in WIPO Standard ST.26 at Paragraphs 45 and 48, in the English language since English is required under Sec. 1.52(b)(1)(ii). Section 1.833(b)(4) is proposed to be added to require that an INSDQualifier_value element includes a value for that element in the English language for each language-dependent free text qualifier in the ``Sequence Listing XML,'' as required by Sec. 1.52(b)(1)(ii), and where an INSDQualifier_value element is defined in WIPO Standard ST.26 (2020), Paragraphs 76 and 85-88. Section 1.834: Section 1.834 is proposed to be added to provide details on the form and format for nucleotide and/or amino acid sequence submissions as the ``Sequence Listing XML'' in patent applications filed on or after January 1, 2022. Section 1.834(a) is proposed to be added to indicate that a ``Sequence Listing XML'' in Unicode UTF-8 created by any means (e.g., text editors, nucleotide/amino acid sequence editors, or other custom computer programs) in accordance with Sec. Sec. 1.831 through 1.833 must: (1) Have the following compatibilities: (i) Computer compatibility: PC or Mac[supreg]; and (ii) operating system compatibility (e.g., MS-DOS[supreg], MS-Windows[supreg], Mac OS[supreg], or Unix[supreg]/Linux[supreg]); (2) be in XML format, where all permitted printable characters (including the space character) and non-printable (control) characters are defined in Paragraph 40 of WIPO Standard ST.26 (2020); and (3) be named as *.xml, where ``*'' is one character or a combination of characters limited to upper- or lowercase letters, numbers, hyphens, and underscores and the name does not exceed 60 characters in total, excluding the extension. No spaces or other types of characters are permitted in the file name. Section 1.834(b) is proposed to be added to require that the ``Sequence Listing XML'' must be in a single file containing the sequence information and be submitted either: (1) Electronically via the USPTO patent electronic filing system, where the file size must not exceed 100 MB and file compression is not permitted; or (2) on read- only optical disc(s) in compliance with Sec. 1.52(e), where (i) a file that is not compressed must be contained on a single read-only optical disc, (ii) the file may be compressed using WinZip[supreg], 7-Zip, or Unix[supreg]/Linux[supreg] Zip, (iii) a compressed file must not be self-extracting, and (iv) a compressed XML file that does not fit on a single read-only optical disc may be split into multiple file parts in accordance with the target read-only optical disc size and labeled in compliance with Sec. 1.52(e)(5)(vi). Section 1.835: Section 1.835 is proposed to be added to provide the requirements for submission of an amendment to add or replace a ``Sequence Listing XML'' for applications filed on or after January 1, 2022. Section 1.835(a) is proposed to be added to require that any amendment to a patent application adding an initial submission of a ``Sequence Listing XML'' as required by Sec. 1.831(a) after the application filing date must include: (1) A ``Sequence Listing XML'' file submitted either (i) via the USPTO patent electronic filing system or (ii) on a read-only optical disc in compliance with Sec. 1.52(e); (2) an instruction to amend the specification to include an incorporation by reference statement of the material in the ``Sequence Listing XML'' file, identifying the name of the file, the date of creation, and the size of the file in bytes (see Sec. 1.77(b)(5)(ii)), except when submitted to the United States International Preliminary Examining Authority for an [[Page 35436]] international application; (3) a statement that indicates the basis for the amendment, with specific references to particular parts of the application as originally filed (specification, claims, drawings) for all sequence data in the ``Sequence Listing XML''; and (4) a statement that the ``Sequence Listing XML'' includes no new matter. Section 1.835(b) is proposed to be added to require that any amendment adding to, deleting from or replacing sequence information in a ``Sequence Listing XML'' submitted as required by Sec. 1.831(a) must include: (1) A replacement ``Sequence Listing XML'' containing the entire ``Sequence Listing XML,'' including any additions, deletions, or replacements of sequence information, and shall be submitted either (i) via the USPTO patent electronic filing system, or (ii) on a read-only optical disc, in compliance with Sec. 1.52(e) labeled as ``REPLACEMENT MM/DD/YYYY'' (with the month, day, and year of creation indicated); (2) an instruction to amend the specification to include an incorporation by reference statement of the material in the replacement ``Sequence Listing XML'' file that identifies the name of the file, the date of creation, and the size of the file in bytes (see Sec. 1.77(b)(5)(ii)), except when the replacement ``Sequence Listing XML'' is submitted to the United States International Preliminary Examining Authority for an international application; (3) a statement that identifies the location of all additions, deletions or replacements of sequence information relative to the replaced ``Sequence Listing XML''; (4) a statement that indicates the support for the additions, deletions or replacements of the sequence information, with specific references to particular parts of the application as originally filed (specification, claims, drawings) for all amended sequence data in the replacement ``Sequence Listing XML''; and (5) a statement that the replacement ``Sequence Listing XML'' includes no new matter. Section 1.835(c) is proposed to be added to require that the specification of a complete application with a ``Sequence Listing XML'' as required under Sec. 1.831(a) present on the application filing date but without an incorporation by reference of the material contained in the ``Sequence Listing XML'' file must be amended to contain a separate paragraph incorporating by reference the material contained in the ``Sequence Listing XML'' file, in accordance with Sec. 1.77(b)(5)(ii), except for international applications. Section 1.835(d)(1) is proposed to be added to provide that when any of the requirements of Sec. Sec. 1.831 through 1.834 is not satisfied in an application under 35 U.S.C. 111(a) or in a national stage application under 35 U.S.C. 371, the applicant will be notified and given a period of time within which to comply with such requirements in order to prevent abandonment of the application. The proposed rule indicates that subject to Sec. 1.835(d)(2), any amendment to add or replace a ``Sequence Listing XML'' in reply to a requirement under this paragraph must be submitted in accordance with the requirements of Sec. 1.835(a) through (c). Section 1.835(d)(2) is proposed to be added to explicitly provide that compliance with Sec. 1.835(a) through (c) is not required for submission of a ``Sequence Listing XML'' that is solely an English translation of a previously submitted ``Sequence Listing XML'' that contains non-English values for the invention title (as per Sec. 1.833(b)(3)) and/or any language-dependent free text elements (as per Sec. 1.833(b)(4)). The required submission will be a translated ``Sequence Listing XML'' in compliance with Sec. Sec. 1.831 through 1.834. Updated values for attributes in the root element (Sec. 1.833(b)(2)(iii)) or elements of the general information part (Sec. 1.833(b)(2)(iv)) are not considered amendments for purposes of complying with Sec. 1.835(a) through (c). Even though Sec. Sec. 1.52(b)(1)(ii) and 1.495(c)(1)(i) require a translation for applications filed under 111(a) and for those entering the national stage, respectively, this proposed rule makes explicit that when a translated ``Sequence Listing XML'' is provided as a reply to a notice that the ``Sequence Listing XML'' contains non-English values for the invention title and/or any language-dependent free text elements, and the translation does not include deletions, additions or replacement of sequence information, the translated ``Sequence Listing XML'' need not comply with the requirements for an amended ``Sequence Listing XML'' as set forth in Sec. 1.835(a) through (c). Section 1.835(e) is proposed to be added to provide that when any of the requirements of Sec. Sec. 1.831 through 1.834 are not satisfied at the time of filing an international application under the PCT where the application is to be searched by the United States International Searching Authority or examined by the United States International Preliminary Examining Authority, the applicant may be sent a notice calling for compliance with the requirements within a prescribed time period. Under PCT Rule 13ter, applicant can provide, in reply to such a requirement or otherwise, a sequence listing which is a ``Sequence Listing XML'' in accordance with Sec. 1.831(a). The ``Sequence Listing XML'' must be accompanied by a statement that the information recorded does not go beyond the disclosure in the international application as filed. It must also be accompanied by the late furnishing fee set forth in Sec. 1.445(a)(5). If the applicant fails to timely provide the required ``Sequence Listing XML, '' the United States International Searching Authority shall search only to the extent that a meaningful search can be performed without the ``Sequence Listing XML,'' and the United States International Preliminary Examining Authority shall examine only to the extent that a meaningful examination can be performed without the ``Sequence Listing XML.'' Section 1.835(f) is proposed to be added to provide that any appropriate amendments to the ``Sequence Listing XML'' in a patent (e.g., by reason of reissue, reexamination, or certificate of correction) must comply with the requirements of paragraph (b) of this section. Section 1.839: Section 1.839 is proposed to be added to provide the location of WIPO Standard ST.26 (2020) that is being incorporated by reference. III. Rulemaking Considerations A. Administrative Procedure Act: The changes proposed in this rulemaking involve rules of agency practice and procedure, and/or interpretive rules. See Bachow Commc'ns Inc. v. FCC, 237 F.3d 683, 690 (D.C. Cir. 2001) (rules governing an application process are procedural under the Administrative Procedure Act); Inova Alexandria Hosp. v. Shalala, 244 F.3d 342, 350 (4th Cir. 2001) (rules for handling appeals are procedural where they do not change the substantive standard for reviewing claims); Nat'l Org. of Veterans' Advocates v. Sec'y of Veterans Affairs, 260 F.3d 1365, 1375 (Fed. Cir. 2001) (rule that clarifies interpretation of a statute is interpretive). Accordingly, prior notice and opportunity for public comment for the changes proposed in this rulemaking are not required pursuant to 5 U.S.C. 553(b) or (c), or any other law. See Cooper Techs. Co. v. Dudas, 536 F.3d 1330, 1336-37 (Fed. Cir. 2008) (stating that 5 U.S.C. 553, and thus 35 U.S.C. 2(b)(2)(B), do not require notice and comment rulemaking for ``interpretative rules, general statements of policy, or rules of agency organization, procedure, or practice'' (quoting 5 U.S.C. 553(b)(A))). However, the USPTO has chosen to seek public comment before [[Page 35437]] implementing the rule to benefit from the public's input. B. Regulatory Flexibility Act: Under the Regulatory Flexibility Act (5 U.S.C. 601 et seq.), whenever an agency is required by 5 U.S.C. 553 (or any other law) to publish a notice of proposed rulemaking (NPRM), the agency must prepare and make available for public comment an Initial Regulatory Flexibility Analysis, unless the agency certifies under 5 U.S.C. 605(b) that the proposed rule, if implemented, will not have a significant economic impact on a substantial number of small entities. 5 U.S.C. 603, 605. For the reasons set forth herein, the Senior Counsel for Regulatory and Legislative Affairs of the USPTO has certified to the Chief Counsel for Advocacy of the Small Business Administration that this rule will not have a significant economic impact on a substantial number of small entities. See 5 U.S.C. 605(b). The USPTO proposes to amend the rules of practice to require submission of biological sequence data in eXtensible Markup Language where the rules of practice incorporate by reference WIPO Standard ST.26, ``Recommended Standard for the Presentation of Nucleotide and Amino Acid Sequence Listings Using XML (eXtensible Markup Language)'' as disclosed in the WIPO Handbook on Industrial Property Information and Documentation. This rulemaking would make more technical data associated with biotechnology inventions available to the public because the new rules of practice based on WIPO Standard ST.26 (2020) provide for enhanced biological sequence data related to disclosures of nucleotides and amino acids in patent applications. WIPO Standard ST.26 provides clear rules as to what must be included in a sequence listing and how sequences must be represented, for example, standardization of representation of modified nucleic acids and amino acids as well as variants derived from primary sequences. WIPO Standard ST.26 contains a guidance document that demonstrates the requirement for inclusion and representation of biological sequence data. As a result, patent applicants will have a clearer understanding as to the requirements and presentation of biological sequence data in a compliant sequence listing under WIPO Standard ST.26. Additionally, since WIPO Standard ST.26 only allows XML format, applicants will not be burdened or confused with the requirements of filing a sequence listing in paper or PDF format, and IPOs will not be burdened with processing paper sequence listings and performing necessary checks on the contents of the paper documents. This rulemaking's proposed changes are largely procedural in nature, and do not impose any additional requirements or fees on applicants. For the foregoing reasons, the changes proposed in this NPRM will not have a significant economic impact on a substantial number of small entities. C. Executive Order 12866 (Regulatory Planning and Review): This rulemaking has been determined to be not significant for purposes of Executive Order 12866 (Sept. 30, 1993). D. Executive Order 13563 (Improving Regulation and Regulatory Review): The USPTO has complied with Executive Order 13563 (Jan. 18, 2011). Specifically, to the extent feasible and applicable, the USPTO has (1) reasonably determined that the benefits of the rule justify its costs; (2) tailored the rule to impose the least burden on society consistent with obtaining the agency's regulatory objectives; (3) selected a regulatory approach that maximizes net benefits; (4) specified performance objectives; (5) identified and assessed available alternatives; (6) involved the public in an open exchange of information and perspectives among experts in relevant disciplines, affected stakeholders in the private sector, and the public as a whole, and provided online access to the rulemaking docket; (7) attempted to promote coordination, simplification, and harmonization across government agencies and identified goals designed to promote innovation; (8) considered approaches that reduce burdens while maintaining flexibility and freedom of choice for the public; and (9) ensured the objectivity of scientific and technological information and processes. E. Executive Order 13132 (Federalism): This rulemaking does not contain policies with federalism implications sufficient to warrant preparation of a Federalism Assessment under Executive Order 13132 (Aug. 4, 1999). F. Executive Order 13175 (Tribal Consultation): This rulemaking will not (1) have substantial direct effects on one or more Indian tribes; (2) impose substantial direct compliance costs on Indian tribal governments; or (3) preempt tribal law. Therefore, a tribal summary impact statement is not required under Executive Order 13175 (Nov. 6, 2000). G. Executive Order 13211 (Energy Effects): This rulemaking is not a significant energy action under Executive Order 13211 because this rulemaking is not likely to have a significant adverse effect on the supply, distribution, or use of energy. Therefore, a Statement of Energy Effects is not required under Executive Order 13211 (May 18, 2001). H. Executive Order 12988 (Civil Justice Reform): This rulemaking meets applicable standards to minimize litigation, eliminate ambiguity, and reduce burden as set forth in sections 3(a) and 3(b)(2) of Executive Order 12988 (Feb. 5, 1996). I. Executive Order 13045 (Protection of Children): This rulemaking does not concern an environmental risk to health or safety that may disproportionately affect children under Executive Order 13045 (Apr. 21, 1997). J. Executive Order 12630 (Taking of Private Property): This rulemaking will not effect a taking of private property or otherwise have taking implications under Executive Order 12630 (Mar. 15, 1988). K. Congressional Review Act: Under the Congressional Review Act provisions of the Small Business Regulatory Enforcement Fairness Act of 1996 (5 U.S.C. 801 et seq.), prior to issuing any final rule, the USPTO will submit a report containing the final rule and other required information to the United States Senate, the United States House of Representatives, and the Comptroller General of the Government Accountability Office. The changes in this rulemaking are not expected to result in an annual effect on the economy of $100 million or more, a major increase in costs or prices, or significant adverse effects on competition, employment, investment, productivity, innovation, or the ability of United States-based enterprises to compete with foreign- based enterprises in domestic and export markets. Therefore, this rulemaking is not expected to result in a ``major rule'' as defined in 5 U.S.C. 804(2). L. Unfunded Mandates Reform Act of 1995: The changes set forth in this rulemaking do not involve a Federal intergovernmental mandate that will result in the expenditure by State, local, and tribal governments, in the aggregate, of $100 million (as adjusted) or more in any one year, or a Federal private sector mandate that will result in the expenditure by the private sector of $100 million (as adjusted) or more in any one year, and will not significantly or uniquely affect small governments. Therefore, no actions are necessary under the provisions of the Unfunded Mandates Reform Act of 1995. See 2 U.S.C. 1501 et seq. M. National Environmental Policy Act of 1969: This rulemaking will not have [[Page 35438]] any effect on the quality of the environment and is thus categorically excluded from review under the National Environmental Policy Act of 1969. See 42 U.S.C. 4321 et seq. N. National Technology Transfer and Advancement Act of 1995: The requirements of section 12(d) of the National Technology Transfer and Advancement Act of 1995 (15 U.S.C. 272 note) are not applicable because this rulemaking does not contain provisions that involve the use of technical standards. O. Paperwork Reduction Act of 1995: The Paperwork Reduction Act of 1995 (44 U.S.C. 3501-3549) requires that the USPTO consider the impact of paperwork and other information collection burdens imposed on the public. In accordance with section 3507(d) of the Paperwork Reduction Act of 1995, the majority of the paperwork and other information collection burdens discussed in this proposed rule have already been approved under the following Office of Management and Budget (OMB) Control Numbers: 0651-0024 (Sequence Listing), 0651-0031 (Patent Processing), 0651-0032 (Initial Patent Applications), and 0651-0064 (Patent Reexaminations and Supplemental Examinations). Modifications to 0651-0024 because of this proposed rulemaking will be submitted to OMB for approval prior to this rule becoming effective. Modifications include the removal of the Sequence Listing in Application (paper), which will result in a reduction in burden associated with this information collection. The USPTO estimates that this information collection's annual burden will decrease by 5,000 responses and 30,000 burden hours. These burden estimates are based on the current OMB approved burdens (response volumes) associated with this information collection, which may be different from any forecasts mentioned in other parts of this proposed rule. The changes discussed in this proposed rule do not affect the information collection requirements or burdens associated with 0651- 0031, 0651-0032 and 0651-0064 listed above; therefore, the USPTO does not plan to take any additional actions on these information collections as a result of this rulemaking. Notwithstanding any other provision of law, no person is required to respond to, nor shall a person be subject to a penalty for failure to comply with, a collection of information subject to the requirements of the Paperwork Reduction Act unless that collection of information has a currently valid OMB control number. P. E-Government Act Compliance: The USPTO is committed to compliance with the E-Government Act to promote the use of the internet and other information technologies, to provide increased opportunities for citizen access to Government information and services, and for other purposes. List of Subjects in 37 CFR Part 1 Administrative practice and procedure, Biologics, Courts, Freedom of information, Incorporation by reference, Inventions and patents, Reporting and recordkeeping requirements, Small businesses. For the reasons stated in the preamble and under the authority contained in 35 U.S.C. 2, as amended, the USPTO proposes to further amend 37 CFR part 1 (as proposed to be amended at 86 FR 28301 (May 26, 2021)) as follows: PART 1--RULES OF PRACTICE IN PATENT CASES 0 1. The authority citation for 37 CFR part 1 continues to read as follows: Authority: 35 U.S.C. 2(b)(2), unless otherwise noted. 0 2. Section 1.52 is amended by: 0 a. Revising paragraph (e)(1)(ii); 0 b. Removing the period at the end of paragraph (e)(3)(iii) and adding ``; and'' in its place; 0 c. Adding paragraph (e)(3)(iv); and 0 d. Revising paragraphs (e)(7), (f)(1) introductory text, (f)(1)(i), (f)(2) introductory text, (f)(2)(i), and (f)(3). The revisions and addition read as follows: Sec. 1.52 Language, paper, writing, margins, read-only optical disc specifications. * * * * * (e) * * * (1) * * * (ii) A ``Sequence Listing'' (submitted under Sec. 1.821(c) in compliance with Sec. 1.824) or a ``Sequence Listing XML'' (submitted under Sec. 1.831(a) in compliance with Sec. Sec. 1.832 through 1.834); or * * * * * (3) * * * (iv) The contents of each read-only optical disc for a ``Sequence Listing XML'' must be in XML file format, and if compressed, must be compressed in accordance with Sec. 1.834. * * * * * (7) Any amendment to the information on a read-only optical disc must be by way of a replacement read-only optical disc, in compliance with Sec. 1.58(g) for ``Large Tables,'' Sec. 1.96(c)(5) for a ``Computer Program Listing Appendix,'' Sec. 1.825(b) for a ``Sequence Listing'' or Computer Readable Form (CRF) of a ``Sequence Listing,'' and Sec. 1.835(b) for a ``Sequence Listing XML.'' * * * * * (f) * * * (1) Submission on read-only optical discs. The application size fee required by Sec. 1.16(s) or Sec. 1.492(j), for an application component submitted in part on a read-only optical disc in compliance with paragraph (e) of this section, shall be determined such that each three kilobytes of content submitted on a read-only optical disc shall be counted as a sheet of paper. Excluded from this determination is any ASCII plain text file or any XML file (as applicable) submitted on a read-only optical disc under paragraph (e) of this section containing: (i) Any ``Sequence Listing'' or CRF of a ``Sequence Listing'' in compliance with Sec. 1.821(c) or (e), or any ``Sequence Listing XML'' in compliance with Sec. 1.831(a); or * * * * * (2) Submission via the USPTO patent electronic filing system. The application size fee required by Sec. 1.16(s) or Sec. 1.492(j), for an application submitted in whole or in part via the USPTO patent electronic filing system, shall be determined such that the paper size equivalent will be considered to be 75% of the number of sheets of paper present in the specification and drawings of the application when entered into the Office records after being rendered by the USPTO patent electronic filing system. Excluded from this determination is any ASCII plain text file or any XML file (as applicable) submitted via the USPTO patent electronic filing system containing: (i) Any ``Sequence Listing'' or CRF of a ``Sequence Listing,'' in compliance with Sec. 1.821(c) or (e) or any ``Sequence Listing XML'' in compliance with Sec. 1.831(a); or * * * * * (3) Oversized submission. Any submission of a ``Sequence Listing'' in electronic form or a ``Sequence Listing XML'' of 300 MB-800 MB filed in an application under 35 U.S.C. 111 or 371 will be subject to the fee set forth in Sec. 1.21(o)(1). Any submission of a ``Sequence Listing'' in electronic form or a ``Sequence Listing XML'' that exceeds 800 MB filed in an application under 35 U.S.C. 111 or 371 will be subject to the fee set forth in Sec. 1.21(o)(2). 0 3. Section 1.53 is amended by revising paragraph (c)(4) to read as follows: Sec. 1.53 Application number, filing date, and completion of application. * * * * * [[Page 35439]] (c) * * * (4) A provisional application is not entitled to the right of priority under 35 U.S.C. 119, 365(a), or 386(a) or Sec. 1.55, or to the benefit of an earlier filing date under 35 U.S.C. 120, 121, 365(c), or 386(c) or Sec. 1.78 of any other application. No claim for priority under 35 U.S.C. 119(e) or Sec. 1.78(a) may be made in a design application based on a provisional application. A provisional application disclosing nucleotide and/or amino acid sequences is not required to include a separate sequence listing; however, if submitted in a provisional application filed on or after January 1, 2022, any submission of biological sequence data must be a ``Sequence Listing XML'' in compliance with Sec. Sec. 1.831 through 1.834. * * * * * 0 4. Section 1.77 is amended by revising paragraph (b)(5) to read as follows: Sec. 1.77 Arrangement of application elements. * * * * * (b) * * * (5) An incorporation by reference statement regarding the material on the: (i) One or more ASCII plain text files, submitted via the USPTO patent electronic filing system or on one or more read-only optical discs (see Sec. 1.52(e)(8)), identifying the names of each file, the date of creation of each file, and the size of each file in bytes, for the following document types: (A) A ``Computer Program Listing Appendix'' (see Sec. 1.96(c)); (B) A ``Sequence Listing'' (see Sec. 1.821(c)); or (C) ``Large Tables'' (see Sec. 1.58(c)). (ii) eXtensible Markup Language (XML) file of the Sequence Listing (``Sequence Listing XML''), submitted via an USPTO patent electronic filing system or on one or more read-only optical discs (see Sec. 1.52(e)(8)), identifying the names of each file, the date of creation of each file, and the size of each file in bytes (Sec. 1.831(a)). * * * * * 0 5. Section 1.121 is amended by revising paragraphs (b) introductory text and (b)(6) read as follows: Sec. 1.121 Manner of making amendments in applications. * * * * * (b) Specification. Amendments to the specification, other than the claims, ``Large Tables'' (Sec. 1.58(c)), a ``Computer Program Listing Appendix'' (Sec. 1.96(c)(5) and (7)), a ``Sequence Listing'' or CRF (Sec. 1.825), or ``Sequence Listing XML''s (Sec. 1.835), must be made by adding, deleting, or replacing a paragraph, by replacing a section, or by a substitute specification, in the manner specified in this section. * * * * * (6) ``Large Tables,'' a ``Computer Program Listing Appendix,'' a ``Sequence Listing,'' or a ``Sequence Listing XML.'' Changes to ``Large Tables,'' a ``Computer Program Listing Appendix,'' a ``Sequence Listing,'' or a ``Sequence Listing XML'' must be made in accordance with Sec. 1.58(g) for ``Large Tables,'' Sec. 1.96(c)(5) for a ``Computer Program Listing Appendix,'' Sec. 1.825 for a ``Sequence Listing,'' and Sec. 1.835 for a ``Sequence Listing XML.'' * * * * * 0 6. Section 1.173 is amended by revising paragraphs (b)(1) and (d) introductory text to read as follows: Sec. 1.173 Reissue specification, drawings, and amendments. * * * * * (b) * * * (1) Specification other than the claims, ``Large Tables'' (Sec. 1.58(c)), a ``Computer Program Listing Appendix'' (Sec. 1.96(c)), a ``Sequence Listing'' (Sec. 1.821(c)) or a ``Sequence Listing XML'' (Sec. 1.831(a)). (i) Changes to the specification, other than to the claims, ``Large Tables'' (Sec. 1.58(c)), a ``Computer Program Listing Appendix'' (Sec. 1.96(c)), a ``Sequence Listing'' (Sec. 1.821(c)) or a ``Sequence Listing XML'' (Sec. 1.831(a)), must be made by submission of the entire text of an added or rewritten paragraph, including markings pursuant to paragraph (d) of this section, except that an entire paragraph may be deleted by a statement deleting the paragraph, without presentation of the text of the paragraph. The precise point in the specification where any added or rewritten paragraph is located must be identified. (ii) Changes to ``Large Tables,'' a ``Computer Program Listing Appendix,'' a ``Sequence Listing,'' or a ``Sequence Listing XML'' must be made in accordance with Sec. 1.58(g) for ``Large Tables,'' Sec. 1.96(c)(5) for a ``Computer Program Listing Appendix,'' Sec. 1.825 for a ``Sequence Listing,'' and Sec. 1.835 for a ``Sequence Listing XML''. * * * * * (d) Changes shown by markings. Any changes relative to the patent being reissued that are made to the specification, including the claims but excluding ``Large Tables,'' a ``Computer Program Listing Appendix,'' a ``Sequence Listing,'' or a ``Sequence Listing XML'', upon filing or by an amendment paper in the reissue application, must include the following markings: * * * * * 0 7. Section 1.211 is amended by revising paragraph (c) to read as follows; Sec. 1.211 Publication of applications. * * * * * (c) An application filed under 35 U.S.C. 111(a) will not be published until it includes the basic filing fee (Sec. 1.16(a) or (c)) and any English translation required by Sec. 1.52(d). The Office may delay publishing any application until it includes any application size fee required by the Office under Sec. 1.16(s) or Sec. 1.492(j), a specification having papers in compliance with Sec. 1.52 and an abstract (Sec. 1.72(b)), drawings in compliance with Sec. 1.84, a ``Sequence Listing'' in compliance with Sec. Sec. 1.821 through 1.825 (if applicable) for an application filed before January 1, 2022, a ``Sequence Listing XML'' in compliance with Sec. Sec. 1.831 through 1.835 (if applicable) for an application filed on or after January 1, 2022, and the inventor's oath or declaration or application data sheet containing the information specified in Sec. 1.63(b). * * * * * 0 8. Section 1.495 is amended by revising paragraph (c)(5) to read as follows: Sec. 1.495 Entering the national stage in the United States of America. * * * * * (c) * * * (5) Translations of a ``Sequence Listing:'' For international applications having an international filing date before January 1, 2022, a ``Sequence Listing'' need not be translated if the ``Sequence Listing'' complies with PCT Rule 12.1(d) and the description complies with PCT Rule 5.2(b). For international applications having an international filing date on or after January 1, 2022, for purposes of paragraph (c)(1)(i) of this section, an English translation is required for any ``Sequence Listing'' in XML format containing non-English language values for the invention title/and or any language-dependent free text qualifiers in accordance with Sec. Sec. 1.831 through 1.834. * * * * * 0 9. Section 1.530 is amended by revising paragraph (d)(1) to read as follows: Sec. 1.530 Statement by patent owner in ex parte reexamination; amendment by patent owner in ex parte or inter partes reexamination; inventorship change in ex parte or inter partes reexamination. * * * * * (d) * * * (1) Specification other than the claims, ``Large Tables'' (Sec. 1.58(c)), a [[Page 35440]] ``Computer Program Listing Appendix'' (Sec. 1.96(c)), a ``Sequence Listing'' (Sec. 1.821(c)) or a ``Sequence Listing XML (Sec. 1.831(a)). (i) Changes to the specification, other than to the claims, ``Large Tables'' (Sec. 1.58(c)), a ``Computer Program Listing Appendix'' (Sec. 1.96(c)), a ``Sequence Listing'' (Sec. 1.821(c)), or a ``Sequence Listing XML'' (Sec. 1.831(a)), must be made by submission of the entire text of an added or rewritten paragraph, including markings pursuant to paragraph (f) of this section, except that an entire paragraph may be deleted by a statement deleting the paragraph, without presentation of the text of the paragraph. The precise point in the specification where any added or rewritten paragraph is located must be identified. (ii) Changes to ``Large Tables,'' a ``Computer Program Listing Appendix,'' a ``Sequence Listing,'' or a ``Sequence Listing XML'' must be made, in accordance with Sec. 1.58(g) for ``Large Tables,'' Sec. 1.96(c)(5) for a ``Computer Program Listing Appendix,'' Sec. 1.825 for a ``Sequence Listing,'' and Sec. 1.835 for a ``Sequence Listing XML.'' * * * * * 0 10. Section 1.704 is amended by revising paragraph (f) to read as follows: Sec. 1.704 Reduction of period of adjustment of patent term. * * * * * (f) An application filed under 35 U.S.C. 111(a) is in condition for examination when the application includes a specification, including at least one claim and an abstract (Sec. 1.72(b)), and has papers in compliance with Sec. 1.52, drawings (if any) in compliance with Sec. 1.84, any English translation required by Sec. 1.52(d) or Sec. 1.57(a), a sequence listing in compliance with Sec. Sec. 1.821 through 1.825 (if applicable), a ``Sequence Listing XML'' in compliance with Sec. Sec. 1.831 through 1.835 (if applicable), an inventor's oath or declaration or an application data sheet containing the information specified in Sec. 1.63(b), the basic filing fee (Sec. 1.16(a) or (c)), the search fee (Sec. 1.16(k) or (m)), the examination fee (Sec. 1.16(o) or (q)), any certified copy of the previously filed application required by Sec. 1.57(a), and any application size fee required by the Office under Sec. 1.16(s). An international application is in condition for examination when the application has entered the national stage as defined in Sec. 1.491(b), and includes a specification, including at least one claim and an abstract (Sec. 1.72(b)), and has papers in compliance with Sec. 1.52, drawings (if any) in compliance with Sec. 1.84, a sequence listing in compliance with Sec. Sec. 1.821 through 1.825 (if applicable), a ``Sequence Listing XML'' in compliance with Sec. Sec. 1.831 through 1.835 (if applicable), the inventor's oath or declaration or an application data sheet containing the information specified in Sec. 1.63(b), the search fee (Sec. 1.492(b)), the examination fee (Sec. 1.492(c)), and any application size fee required by the Office under Sec. 1.492(j). An application shall be considered as having papers in compliance with Sec. 1.52, drawings (if any) in compliance with Sec. 1.84, and a sequence listing in compliance with Sec. Sec. 1.821 through 1.825 (if applicable) or a ``Sequence Listing XML'' in compliance with Sec. Sec. 1.831 through 1.835 (if applicable), for purposes of this paragraph (f) on the filing date of the latest reply (if any) correcting the papers, drawings, or sequence listing that is prior to the date of mailing of either an action under 35 U.S.C. 132 or a notice of allowance under 35 U.S.C. 151, whichever occurs first. 0 11. Sections 1.831 through 1.835 and 1.839 are added to read as follows: Sec. 1.831 Requirements for patent applications filed on or after January 1, 2022, having nucleotide and/or amino acid sequence disclosures. 1.832 Representation of nucleotide and/or amino acid sequence data in the ``Sequence Listing XML'' part of a patent application filed on or after January 1, 2022. 1.833 Requirements for a ``Sequence Listing XML'' for nucleotide and/or amino acid sequences as part of a patent application filed on or after January 1, 2022. 1.834 Form and format for nucleotide and/or amino acid sequence submissions as the ``Sequence Listing XML'' in patent applications filed on or after January 1, 2022. 1.835 Amendment to add or replace a ``Sequence Listing XML'' in patent applications filed on or after January 1, 2022. 1.839 Incorporation by reference. * * * * * Sec. 1.831 Requirements for patent applications filed on or after January 1, 2022, having nucleotide and/or amino acid sequence disclosures. (a) Patent applications disclosing nucleotide and/or amino acid sequences by enumeration of their residues, as defined in paragraph (b) of this section, must contain, as a separate part of the disclosure, a computer readable Sequence Listing in XML (eXtensible Markup Language) format (a ``Sequence Listing XML''). Disclosed nucleotide or amino acid sequences that do not meet the definition of paragraph (b) of this section must not be included in the ``Sequence Listing XML.'' The ``Sequence Listing XML'' contains the sequence information of the nucleotides and/or amino acids disclosed in the patent application using the symbols and format in accordance with the requirements of Sec. Sec. 1.832 through 1.834. (b) Nucleotide and/or amino acid sequences as used in Sec. Sec. 1.831 through 1.835, encompass: (1) An unbranched sequence or linear region of a branched sequence containing 4 or more specifically defined amino acids, wherein the amino acids form a single peptide backbone; or (2) An unbranched sequence or linear region of a branched sequence of 10 or more specifically defined nucleotides, wherein adjacent nucleotides are joined by: (i) A 3' to 5' (or 5' to 3') phosphodiester linkage; or (ii) Any chemical bond that results in an arrangement of adjacent nucleobases that mimics the arrangement of nucleobases in naturally occurring nucleic acids, (i.e., nucleotide analogs). (c) Where the description or claims of a patent application discuss a sequence that is set forth in the ``Sequence Listing XML'' in accordance with paragraph (a) of this section, reference must be made to the sequence by use of the sequence identifier, preceded by SEQ ID NO: Or the like in the text of the description or claims, even if the sequence is also embedded in the text of the description or claims of the patent application. (d) ``Enumeration of its residues'' means disclosure of a nucleotide or amino acid sequence in a patent application by listing, in order, each residue of the sequence, where the residues are represented in the manner as defined in WIPO Standard ST.26 (2020) (incorporated by reference, see Sec. 1.839), paragraph 3(c)(i) or (ii). (e) ``Specifically defined'' means any amino acid or nucleotide as defined in WIPO Standard ST.26 (2020), paragraph 3(m). (f) ``Amino acid'' includes any D- or L-amino acid or modified amino acid as defined in WIPO Standard ST.26 (2020), paragraph 3(a). (g) ``Modified amino acid'' includes any amino acid as described in WIPO Standard ST.26 (2020), paragraph 3(g). (h) ``Nucleotide'' includes any nucleotide, nucleotide analog or modified nucleotide as defined in WIPO Standard ST.26 (2020), paragraphs 3(h) and 3(i). (i) ``Modified nucleotide'' includes any nucleotide as described in WIPO Standard ST.26 (2020), paragraph 3(h). [[Page 35441]] Sec. 1.832 Representation of nucleotide and/or amino acid sequence data in the ``Sequence Listing XML'' part of a patent application filed on or after January 1, 2022. (a) Each disclosed nucleotide or amino acid sequence that meets the requirements of Sec. 1.831(b) must appear separately in the ``Sequence Listing XML''. Each sequence set forth in the ``Sequence Listing XML'' must be assigned a separate sequence identifier. The sequence identifiers must begin with 1 and increase sequentially by integers as defined in WIPO Standard ST.26 (2020) (incorporated by reference, see Sec. 1.839), paragraph 10. (b) The representation and symbols for nucleotide sequence data shall conform to the requirements of paragraphs (b)(1) through (4) of this section. (1) A nucleotide sequence must be represented in the manner described in WIPO Standard ST.26 (2020), paragraphs 11-12. (2) All nucleotides, including nucleotide analogs, modified nucleotides, and ``unknown'' nucleotides, within a nucleotide sequence must be represented using the symbols set forth in WIPO Standard ST.26 (2020), paragraphs 13-16, 19 and 21. (3) Modified nucleotides within a nucleotide sequence must be described in the manner discussed in WIPO Standard ST.26 (2020), paragraphs 17-18, and 19. (4) A region containing a known number of contiguous ``a'', ``c'', ``g'', ``t'', or ``n'' residues for which the same description applies may be jointly described in the manner described in WIPO Standard ST.26 (2020), paragraph 22. (c) The representation and symbols for amino acid sequence data shall conform to the requirements of paragraphs (c)(1) through (4) of this section. (1) The amino acids in an amino acid sequence must be represented in the manner described in WIPO Standard ST.26 (2020), paragraphs 24- 25. (2) All amino acids, including modified amino acids and ``unknown'' amino acids, within an amino acid sequence must be represented using the symbols set forth in WIPO Standard ST.26 (2020), paragraphs 26-29 and 32. (3) Modified amino acid within an amino acid sequence must be described in the manner discussed in WIPO Standard ST.26 (2020), paragraphs 29 and 30. (4) A region containing a known number of contiguous ``X'' residues for which the same description applies may be jointly described in the manner described in WIPO Standard ST.26 (2020), paragraph 34. (d) A nucleotide and/or amino acid sequence that is constructed as a single continuous sequence derived from one or more non-contiguous segments of a larger sequence or from segments of different sequences must be listed in a sequence listing in the manner described in WIPO Standard ST.26 (2020), paragraph 35. (e) A nucleotide and/or amino acid sequence that contains regions of specifically defined residues separated by one or more regions of contiguous ``n'' or ``X'' residues, wherein the exact number of ``n'' or ``X'' residues in each region is disclosed, must be listed in a sequence listing in the manner described in WIPO Standard ST.26 (2020), paragraph 36. (f) A nucleotide and/or amino acid sequence that contains regions of specifically defined residues separated by one or more gaps of an unknown or undisclosed number of residues must be listed in a sequence listing in the manner described in WIPO Standard ST.26 (2020), paragraph 37. Sec. 1.833 Requirements for a ``Sequence Listing XML'' for nucleotide and/or amino acid sequences as part of a patent application filed on or after January 1, 2022. (a) The ``Sequence Listing XML'' as required by Sec. 1.831(a) must be presented as a single file in XML 1.0 encoded using Unicode UTF-8 where the character set complies with WIPO Standard ST.26 (2020) (incorporated by reference, see Sec. 1.839), paragraphs 40 and 41 and Annex IV thereof. (b) The ``Sequence Listing XML'' as required by Sec. 1.833(a) must: (1) Be valid according to the Document Type Definition (DTD) as presented in Annex II of WIPO Standard ST.26 (2020). (2) Comply with the requirements of WIPO Standard ST.26 (2020) to include: (i) An XML declaration as defined in WIPO Standard ST.26 (2020), paragraph 39; (ii) A document type (DOCTYPE) declaration as defined in WIPO Standard ST.26 (2020), paragraph 39; (iii) A root element as defined in WIPO Standard ST.26 (2020), paragraph 43; (iv) A general information part that complies with the requirements of WIPO Standard ST.26 (2020), paragraphs 45, 47 and 48, as applicable; and (v) A sequence data part that complies with the requirements of WIPO Standard ST.26 (2020), paragraphs 50-55, 57-58, 60-69, 71-78, 80- 87, 89-98 and 100, as applicable. (3) Include one InventionTitle element in the English language, in the format required by WIPO Standard ST.26 (2020), paragraphs 45 and 48, and as required by Sec. 1.52(b)(1)(ii). (4) Include an INSDQualifier_value element with a value in the English language for any language-dependent free text qualifier as defined by WIPO Standard ST.26 (2020), paragraphs 76 and 85-88, and as required by Sec. 1.52(b)(1)(ii). Sec. 1.834 Form and format for nucleotide and/or amino acid sequence submissions as the ``Sequence Listing XML'' in patent applications filed on or after January 1, 2022. (a) A ``Sequence Listing XML'' encoded using Unicode UTF-8, created by any means (e.g., text editors, nucleotide/amino acid sequence editors, or other custom computer programs) in accordance with Sec. Sec. 1.831 through 1.833, must: (1) Have the following compatibilities: (i) Computer compatibility: PC or Mac[supreg]; and (ii) Operating system compatibility: MS-DOS[supreg], MS- Windows[supreg], Mac OS[supreg], or Unix[supreg]/Linux[supreg]. (2) Be in XML format, where all permitted printable characters (including the space character) and non-printable (control) characters are defined in WIPO Standard ST.26 (2020) (incorporated by reference, see Sec. 1.839), paragraph 40. (3) Be named as *.xml, where ``*'' is one character or a combination of characters limited to upper- or lowercase letters, numbers, hyphens, and underscores and the name does not exceed 60 characters in total, excluding the extension. No spaces or other types of characters are permitted in the file name. (b) The ``Sequence Listing XML'' must be in a single file containing the sequence information and be submitted either: (1) Electronically via the USPTO patent electronic filing system, where the file size must not exceed 100 MB, and file compression is not permitted; or (2) On read-only optical disc(s) in compliance with Sec. 1.52(e), where: (i) A file that is not compressed must be contained on a single read-only optical disc; (ii) The file may be compressed using WinZip[supreg], 7-Zip, or Unix[supreg]/Linux[supreg] Zip; (iii) A compressed file must not be self-extracting; and (iv) A compressed XML file that does not fit on a single read-only optical disc may be split into multiple file parts, in accordance with the target read-only [[Page 35442]] optical disc size, and labeled in compliance with Sec. 1.52(e)(5)(vi). Sec. 1.835 Amendment to add or replace a ``Sequence Listing XML'' in patent applications filed on or after January 1, 2022. (a) Any amendment to a patent application adding an initial submission of a ``Sequence Listing XML'' as required by Sec. 1.831(a) after the application filing date must include: (1) A ``Sequence Listing XML'' in accordance with Sec. Sec. 1.831 through 1.834, submitted as an XML file: (i) Via the USPTO patent electronic filing system; or (ii) On a read-only optical disc, in compliance with Sec. 1.52(e); (2) A request to amend the specification to include an incorporation by reference statement of the material in the ``Sequence Listing XML'' file, identifying the name of the file, the date of creation, and the size of the file in bytes (see Sec. 1.77(b)(5)(ii)), except when submitted to the United States International Preliminary Examining Authority for an international application; (3) A statement that indicates the basis for the amendment, with specific references to particular parts of the application as originally filed (specification, claims, drawings) for all sequence data in the ``Sequence Listing XML;'' and (4) A statement that the ``Sequence Listing XML'' includes no new matter. (b) Any amendment adding to, deleting from, or replacing sequence information in a ``Sequence Listing XML'' submitted as required by Sec. 1.831(a) must include: (1) A replacement ``Sequence Listing XML'' in accordance with the requirements of Sec. Sec. 1.831 through 1.834 containing the entire ``Sequence Listing XML'' including any additions, deletions, or replacements of sequence information, and shall be submitted: (i) Via the USPTO patent electronic filing system; or (ii) On a read-only optical disc, in compliance with Sec. 1.52(e) labeled as ``REPLACEMENT MM/DD/YYYY'' (with the month, day, and year of creation indicated); (2) A request to amend the specification to include an incorporation by reference statement of the material in the replacement ``Sequence Listing XML'' file that identifies the name of the file, the date of creation, and the size of the file in bytes (see Sec. 1.77(b)(5)(ii)), except when the replacement ``Sequence Listing XML'' is submitted to the United States International Preliminary Examining Authority for an international application; (3) A statement that identifies the location of all additions, deletions, or replacements of sequence information relative to replaced ``Sequence Listing XML;'' (4) A statement that indicates the support for the additions, deletions, or replacements of the sequence information, with specific references to particular parts of the application as originally filed (specification, claims, drawings) for all amended sequence data in the replacement ``Sequence Listing XML;'' and (5) A statement that the replacement ``Sequence Listing XML'' includes no new matter. (c) The specification of a complete application, filed on the application filing date, with a ``Sequence Listing XML'' as required under Sec. 1.831(a), without an incorporation by reference of the material contained in the ``Sequence Listing XML'' file, must be amended to contain a separate paragraph incorporating by reference the material contained in the ``Sequence Listing XML'' file, in accordance with Sec. 1.77(b)(5)(ii), except for international applications. (d)(1) If any of the requirements of Sec. Sec. 1.831 through 1.834 are not satisfied in an application under 35 U.S.C. 111(a) or in a national stage application under 35 U.S.C. 371, the applicant will be notified and given a period of time within which to comply with such requirements in order to prevent abandonment of the application. Subject to paragraph (d)(2) of this section, any amendment to add or replace a ``Sequence Listing XML'' in reply to a requirement under this paragraph (d)(1) must be submitted in accordance with the requirements of paragraphs (a) through (c) of this section. (2) Compliance with paragraphs (a) through (c) of this section is not required for submission of a ``Sequence Listing XML'' that is solely an English translation of a previously submitted ``Sequence Listing XML'' that contains non-English values for the invention title (as per Sec. 1.833(b)(3)) and/or any language-dependent free text elements (as per Sec. 1.833(b)(4)). The required submission will be a translated ``Sequence Listing XML'' in compliance with Sec. Sec. 1.831 through 1.834. Updated values for attributes in the root element (Sec. 1.833(b)(2)(iii)) or elements of the general information part (Sec. 1.833(b)(2)(iv)) are not considered amendments for purposes of complying with paragraphs (a) through (c) of this section. (e) If any of the requirements of Sec. Sec. 1.831 through 1.834 are not satisfied at the time of filing an international application under the PCT where the application is to be searched by the United States International Searching Authority or examined by the United States International Preliminary Examining Authority, the applicant may be sent a notice necessitating compliance with the requirements within a prescribed time period. Under PCT Rule 13ter applicant can provide, in reply to such a requirement or otherwise, a sequence listing which is a ``Sequence Listing XML'' in accordance with Sec. 1.831(a). The ``Sequence Listing XML'' must be accompanied by a statement that the information recorded does not go beyond the disclosure in the international application as filed. It must also be accompanied by the late furnishing fee set forth in Sec. 1.445(a)(5). If the applicant fails to timely provide the required ``Sequence Listing XML,'' the United States International Searching Authority shall search only to the extent that a meaningful search can be performed without the ``Sequence Listing XML,'' and the United States International Preliminary Examining Authority shall examine only to the extent that a meaningful examination can be performed without the ``Sequence Listing XML.'' (f) Any appropriate amendments to the ``Sequence Listing XML'' in a patent (e.g., by reason of reissue, reexamination, or certificate of correction) must comply with the requirements of paragraph (b) of this section. Sec. 1.839 Incorporation by reference. (a) Certain material is incorporated by reference into this subpart with the approval of the approval of the Director of the Federal Register under 5 U.S.C. 552(a) and 1 CFR part 51. All approved material is available for inspection at The United States Patent and Trademark Office, Office of Patent Legal Administration, 571-272-7701, and from the sources listed elsewhere in this section. It is also available for inspection at the National Archives and Records Administration (NARA). For information on the availability of this material at NARA, email [email protected] or go to www.archives.gov/federal-register/cfr/ibr-locations.html. (b) World Intellectual Property Organization (WIPO); 34 chemin des Colombettes; 1211 Geneva 20 Switzerland, www.wipo.int. (1) WIPO Standard ST.26 (2020). WIPO Handbook on Industrial Property Information and Documentation, Standard ST.26: Recommended [[Page 35443]] Standard for the Presentation of Nucleotide and Amino Acid Sequence Listings Using XML (eXtensible Markup Language) (2020), including Annexes I-VII (www.wipo.int/export/sites/www/standards/en/pdf/03-26-01.pdf); IBR approved for Sec. Sec. 1.831 through 1.834. (2) [Reserved] Andrew Hirshfeld, Commissioner for Patents, Performing the Functions and Duties of the Under Secretary of Commerce for Intellectual Property and Director of the United States Patent and Trademark Office. [FR Doc. 2021-14325 Filed 7-2-21; 8:45 am] BILLING CODE 3510-16-P