o Å¸eZ‰ã@sÂddlZddlZddlZddlmZddlmZddlmZm Z zddlmZWne y7ddlmZYnwzddlmZWne yKdZYnwddlmZdd lmZgd ¢Zej ¡Zded<d]dd„Z dd„Ze d¡Zd^dd„Zdd„ZdZdZdZe d_idd“dd“dd“dd“dd“d d“d!d“d"d“d#d“d$d“d%d“d&d“d'd“d(d“d)d“d*d“d+d“d,d“d-d“d.d“d/d“d0e“d1e“d2e“d3e“d4e“ŽZ!de!d5<d6d7„Z"d8d9„Z#d:d;„Z$Gdd?„d?e&ƒZ'Gd@dA„dAe'ƒZ(GdBdC„dCe%ƒZ)e)ƒZ*GdDdE„dEe)ƒZ+e+ƒZ,ZGdFdG„dGe%ƒZ-GdHdI„dIe%ƒZ.GdJdK„dKeƒZ/GdLdM„dMe/ƒZ0GdNdO„dOe/ƒZ1d]dPdQ„Z2d`dRdS„Z3dadTdU„Z4dVdW„Z5e6dXdY„e7dZƒDƒƒZ8d[d\„Z9dS)béN)Úentities)Ú HTMLParser)ÚMarkupÚescape)Úsoft_str)Úsoft_unicode)Ú LazyProxy)Ú TracError)Ú to_unicode)Ú DeuglifierÚFormTokenInjectorÚTracHTMLSanitizerrÚfind_elementÚhtmlÚis_safe_originÚ plaintextÚtagÚto_fragmentÚ stripentitiesÚ striptagsÚvalid_html_bytesÚunescapeé'ÚaposTcCstt|tƒr|St|tƒrt|ƒSt|ƒ}|r&d|vr|Stt|ƒ dd¡ƒSd|vr,|Stt|ƒ dd¡ dd¡ƒS)a3Create a Markup instance from a string and escape special characters it may contain (<, >, & and "). :param text: the string to escape; if not a string, it is assumed that the input can be converted to a string :param quotes: if ``True``, double quote characters are escaped in addition to the other special characters >>> escape('"1 < 2"') Markup('"1 < 2"') >>> escape(['"1 < 2"']) Markup("['"1 < 2"']") If the `quotes` parameter is set to `False`, the " character is left as is. Escaping quotes is generally only required for strings that are to be used in attribute values. >>> escape('"1 < 2"', quotes=False) Markup('"1 < 2"') >>> escape(['"1 < 2"'], quotes=False) Markup('[\'"1 < 2"\']') However, `escape` behaves slightly differently with `Markup` and `Fragment` behave instances, as they are passed through unmodified. >>> escape(Markup('"1 < 2 '"')) Markup('"1 < 2 '"') >>> escape(Markup('"1 < 2 '"'), quotes=False) Markup('"1 < 2 '"') >>> escape(tag.b('"1 < 2"')) Markup('"1 < 2"') >>> escape(tag.b('"1 < 2"'), quotes=False) Markup('"1 < 2"') :return: the escaped `Markup` string :rtype: `Markup` z'ú'zz"ú")Ú isinstancerÚFragmentÚ escape_quotesÚstrÚreplace)ÚtextÚquotesÚe©r$ú0/usr/lib/python3/dist-packages/trac/util/html.pyr3s - rcCs|sdSt|tƒs|S| ¡S)aWReverse-escapes &, <, >, and " and returns a `str` object. >>> unescape(Markup('1 < 2')) '1 < 2' If the provided `text` object is not a `Markup` instance, it is returned unchanged. >>> unescape('1 < 2') '1 < 2' :param text: the text to unescape :return: the unescsaped string :rtype: `str` Ú)rrr©r!r$r$r%rns rz-&(?:#((?:\d+)|(?:[xX][0-9a-fA-F]+));?|(\w+);)Fcs*‡fdd„}t|tƒrt|ƒ}t ||¡S)uŸReturn a copy of the given text with any character or numeric entities replaced by the equivalent UTF-8 characters. >>> stripentities('1 < 2') '1 < 2' >>> stripentities('more …') 'more â€¦' >>> stripentities('…') 'â€¦' >>> stripentities('…') 'â€¦' >>> stripentities(Markup('â€¦')) 'â€¦' If the `keepxmlentities` parameter is provided and is a truth value, the core XML entities (&, ', >, < and ") are left intact. >>> stripentities('1 < 2 …', keepxmlentities=True) '1 < 2 â€¦' :return: a `str` instance with entities removed :rtype: `str` csž| d¡r%| d¡}| d¡rt|dd…dƒ}t|ƒSt|dƒ}t|ƒS| d¡}ˆr4|dvr4d|Sztt|ƒWStyNˆrJd|YS|YSw) Né)ÚxÚXéé é)ÚamprÚgtÚltÚquotz&%s;z&%s;)ÚgroupÚ startswithÚintÚchrÚ_name2codepointÚKeyError)ÚmatchÚref©Úkeepxmlentitiesr$r%Ú_replace_entityžs" ÿ üz&stripentities.._replace_entity)rrrÚ_STRIPENTITIES_REÚsub)r!r;r<r$r:r%r†s rcCst|ƒ ¡S)a½Return a copy of the text with any XML/HTML tags removed. >>> striptags('Foo bar') 'Foo bar' >>> striptags('Foo') 'Foo' >>> striptags('Foo
') 'Foo' HTML/XML comments are stripped, too: >>> striptags('test') 'test' :param text: the string to remove tags from :return: a `str` instance with all tags removed :rtype: `str` )rrr'r$r$r%r¶sr)ÚnoÚyes)ÚoffÚon)ÚfalseÚtrueÚ autofocusÚautoplayÚcheckedÚcontrolsÚdefaultÚdeferÚdisabledÚformnovalidateÚhiddenÚismapÚloopÚmultipleÚmutedÚ novalidateÚopenÚreadonlyÚrequiredÚreversedÚscopedÚseamlessÚselectedÚcontenteditableÚ draggableÚ spellcheckÚ translateÚautocompleteÚasynccCs²|dkrt|tƒrtdi|¤Žpd}n>> classes('foo', 'bar') 'foo bar' In addition, the names of any supplied keyword arguments are added if they have a truth value: >>> classes('foo', bar=True) 'foo bar' >>> classes('foo', bar=False) 'foo' >>> classes(foo=True, bar=True) 'bar foo' If none of the arguments are added to the list, this function returns `''`: >>> classes(bar=False) '' Nc3s|] }ˆ|r|VqdS©Nr$)Ú.0Úk©Úkwargsr$r%Ú s€zclasses..ú )rdÚfilterÚextendÚsortedÚjoin)Úargsrprcr$ror%rcs rccOspi}g}td|ƒD]}t|tƒr| |¡q | |¡q | |¡| dd„t| ¡dd„dDƒ¡d |¡S)ašHelper function for dynamically assembling a list of CSS style name and values in templates. Any positional arguments are added to the list of styles. All positional arguments must be strings or dicts: >>> styles('foo: bar', 'fu: baz', {'bottom-right': '1em'}) 'foo: bar; fu: baz; bottom-right: 1em' In addition, the names of any supplied keyword arguments are added if they have a string value: >>> styles('foo: bar', fu='baz') 'foo: bar; fu: baz' >>> styles('foo: bar', bar=False) 'foo: bar' If none of the arguments are added to the list, this function returns `''`: >>> styles(bar=False) '' Ncss$|] \}}|rd||fVqdS)z%s: %sNr$)rmrnÚvr$r$r%rqDs €þzstyles..cSs|dS)Nrr$)Úir$r$r%ÚEszstyles..)rhú; ) rsrrbÚupdateÚappendrtruÚitemsrv)rwrpÚdreÚargr$r$r%re#s ÿ rec@sLeZdZdZdZdd„Zdd„Zdd„Zd d „Zdd„Z d d„Z dd„ZdS)rz8A fragment represents a sequence of strings or elements.)ÚchildrencGsg|_|D]}| |¡qdSrl)rr}©Úselfrwr€r$r$r%Ú__init__OsÿzFragment.__init__cCstt|ƒƒSrl)rr©rƒr$r$r%Ú__html__TszFragment.__html__cCód dd„|jDƒ¡S)Nr&css|]}t|dƒVqdS)FN©r©rmÚcr$r$r%rqXs€z#Fragment.__str__..©rvrr…r$r$r%Ú__str__WózFragment.__str__cCs t||ƒSrl©r)rƒÚotherr$r$r%Ú__add__Zó zFragment.__add__cGs|D]}| |¡q|Srl)r}r‚r$r$r%Ú__call__]szFragment.__call__cCs€|r2t|tttttfƒr|j |¡dSz |D]}| |¡qWdSty1|j |¡YdSw|dkr>|j d¡dSdS)NrÚ0) rrrÚbytesr4Úfloatrr}Ú TypeError)rƒr€Úeltr$r$r%r}bsÿÿÿzFragment.appendcCr‡)Nr&css*|]}t|tƒr | ¡nt|ƒVqdSrl)rrÚas_textrr‰r$r$r%rqqs€"ÿz#Fragment.as_text..r‹r…r$r$r%r˜ps ÿzFragment.as_textN)Ú__name__Ú __module__Ú__qualname__Ú__doc__Ú __slots__r„r†rŒrr’r}r˜r$r$r$r%rJsrc@sHeZdZdZdZiZdZdZdd„Zdd„Z d d „Z dd„Zd d„ZdS)Ú XMLElementzXAn element represents an XML element, with a tag name, attributes and content. )rÚattribr$z/>cOs<tj|g|¢RŽt|ƒ|_|r| |¡|_dS|j|_dSrl)rr„rrÚ_dict_from_kwargsÚEMPTY_ATTRIBrŸ)rƒrrwrpr$r$r%r„ƒs ÿÿzXMLElement.__init__cCs|Srlr$©rƒrnrxr$r$r%Ú_attr_value‰ózXMLElement._attr_valuecCshg}| ¡D])\}}|dur/|dd…dkr|dd…}| ||¡}|dur/| |t|ƒf¡qt|ƒS)NéÿÿÿÿÚ_)r~r£r}rrb)rƒrpÚattrsrnrxr$r$r%r Œs€zXMLElement._dict_from_kwargscOsD|r| |¡}|r|jr|j |¡n||_|D]}| |¡q|Srl)r rŸr|r})rƒrwrprr€r$r$r%r’—s zXMLElement.__call__cCs d|j}|jr+g}t|jƒD]}|j|}|r!| d||f¡q|r+|d |¡7}|js7|jrI|j|jvrI|dt |¡d|jd7}|S||j 7}|S)Nú<ú %s="%s"r&ú>úÚbrÚhrÚcolÚimgÚwbrÚareaÚbaseÚlinkÚmetaÚembedÚinputÚparamÚtrackÚkeygenÚsourceÚcommandz />r$cCs t||ƒSrl)rkr¢r$r$r%r£Ær‘zElement._attr_valueN)r™ršr›rœr¬rrr£r$r$r$r%r®¶sr®c@ó eZdZdZdd„Zdd„ZdS)ÚXMLElementFactoryzhAn XML element factory can be used to build Fragments and XMLElements for arbitrary tag names. cGst|ŽSrlrŽ)rƒrwr$r$r%r’ÐózXMLElementFactory.__call__cCót|ƒSrl)rž©rƒrr$r$r%Ú__getattr__ÓrÁzXMLElementFactory.__getattr__N)r™ršr›rœr’rÄr$r$r$r%rÀÊsrÀc@seZdZdZdd„ZdS)ÚElementFactoryzaAn element factory can be used to build Fragments and Elements for arbitrary tag names. cCrÂrl)r®rÃr$r$r%rÄÞrÁzElementFactory.__getattr__N)r™ršr›rœrÄr$r$r$r%rÅØsrÅc@sôeZdZdZegd¢ƒZegd¢ƒZegd¢ƒZegd¢ƒZegd¢ƒZ edgƒZ eeeee e fdd „Ze d ¡jZe d¡jZdd „Zdd„Zdd„Zdd„Zdd„Zdd„Ze d¡jZe dej¡jZdd„Zdd„Ze d¡jZdd „Zd!S)"r a³Sanitize HTML constructions which are potentially vector of phishing or XSS attacks, in user-supplied HTML. The usual way to use the sanitizer is to call the `sanitize` method on some potentially unsafe HTML content. See also `genshi.HTMLSanitizer`_ from which the TracHTMLSanitizer has evolved. .. _genshi.HTMLSanitizer: http://genshi.edgewall.org/wiki/Documentation/filters.html#html-sanitizer )GÚaÚabbrÚacronymÚaddressr´ÚbÚbigÚ blockquoter¯ÚbuttonÚcaptionÚcenterÚciteÚcoder±ÚcolgroupÚddÚdelÚdfnÚdirÚdivÚdlÚdtÚemÚfieldsetÚfontÚformÚh1Úh2Úh3Úh4Úh5Úh6r°ryr²r¹ÚinsÚkbdÚlabelÚlegendÚliÚmapÚmenuÚolÚoptgroupÚoptionÚpÚpreÚqÚsÚsampÚselectÚsmallÚspanÚstrikeÚstrongr>ÚsupÚtableÚtbodyÚtdÚtextareaÚtfootÚthÚtheadÚtrÚttÚuÚulÚvar)IrÇÚacceptzaccept-charsetÚ accesskeyÚactionÚalignÚaltÚaxisÚbgcolorÚborderÚcellpaddingÚcellspacingÚcharÚcharoffÚcharsetrGrÐr`ÚclearÚcolsÚcolspanÚcolorÚcompactÚcoordsÚdatetimerÖrKÚenctypeÚforÚframeÚheadersÚheightÚhrefÚhreflangÚhspaceÚidrNræÚlangÚlongdescÚ maxlengthÚmediaÚmethodrPÚnameÚnohrefÚnoshadeÚnowrapÚpromptrTÚrelÚrevÚrowsÚrowspanÚrulesÚscoperYÚshapeÚsizerõÚsrcÚstartraÚsummaryÚtabindexÚtargetÚtitleÚtypeÚusemapÚvalignÚvalueÚvspaceÚwidth)eÚ backgroundzbackground-attachmentzbackground-colorzbackground-imagezbackground-positionzbackground-repeatrz border-bottomzborder-bottom-colorzborder-bottom-stylezborder-bottom-left-radiuszborder-bottom-right-radiuszborder-bottom-widthzborder-collapsezborder-colorzborder-leftzborder-left-colorzborder-left-stylezborder-left-widthz border-radiuszborder-rightzborder-right-colorzborder-right-stylezborder-right-widthzborder-spacingzborder-stylez border-topzborder-top-colorzborder-top-left-radiuszborder-top-right-radiuszborder-top-stylezborder-top-widthzborder-widthÚbottomzcaption-siderÚcliprÚcontentzcounter-incrementz counter-resetÚcursorÚ directionÚdisplayzempty-cellsr•rÜzfont-familyz font-sizez font-stylezfont-variantzfont-weightrÚleftzletter-spacingzline-heightz list-stylezlist-style-imagezlist-style-positionzlist-style-typeÚmarginz margin-bottomzmargin-leftzmargin-rightz margin-topz max-heightz max-widthz min-heightz min-widthÚopacityÚorphansÚoutlinez outline-colorz outline-stylez outline-widthÚoverflowÚpaddingzpadding-bottomzpadding-leftz padding-rightzpadding-topzpage-break-afterzpage-break-beforezpage-break-insideÚpositionr"Úrightztable-layoutz text-alignztext-decorationztext-indentztext-transformÚtopzunicode-bidizvertical-alignÚ visibilityzwhite-spaceÚwidowsr?zword-spacingzz-index)ÚfileÚftpÚhttpÚhttpsÚmailtoN)rr@ÚdynsrcrÚlowsrcr4zdata:cCs(||_||_||_||_||_||_dS)zyNote: safe_schemes and safe_css have to remain the first parameters, for backward-compatibility purpose. N)Ú safe_tagsÚ safe_attrsÚsafe_cssÚ uri_attrsÚsafe_schemesÚsafe_origins)rƒr^r\rZr[r]r_r$r$r%r„3s zTracHTMLSanitizer.__init__uc[eEï¼¥ï½…][xXï¼¸ï½˜][pPï¼°ï½][rRÊ€ï¼²ï½’][eEï¼¥ï½…][sSï¼³ï½“]{2}[iIÉªï¼©ï½‰][oOï¼¯ï½][nNÉ´ï¼®ï½Ž]u[Uu][RrÊ€][LlÊŸ]\s*\(([^)]+)cCs.t|t ¡ƒ}| |¡| ¡t|j ¡ƒS)zÏTransforms the incoming HTML by removing anything's that deemed unsafe. :param html: the input HTML :type: str :return: the sanitized content :rtype: Markup )ÚHTMLSanitizationÚioÚStringIOÚfeedÚcloserÚoutÚgetvalue)rƒrÚ transformr$r$r%Úsanitizeds zTracHTMLSanitizer.sanitizecCs8||jvrdS|dkr| ¡dkS| d¡rd|vSdS)z|Determine whether the given css property declaration is to be considered safe for inclusion in the output. FrNÚstaticrHú-T)r\Úlowerr3)rƒÚpropr=r$r$r%Úis_safe_cssss zTracHTMLSanitizer.is_safe_csscCs&||jvrdS|dkrd|vrdSdS)aODetermine whether the given element should be considered safe for inclusion in the output. :param tag: the tag name of the element :type tag: str :param attrs: the element attributes :type attrs: list :return: whether the element should be considered safe :rtype: bool Fr¹)r:ÚpasswordT)rZ©rƒrr§r$r$r%Úis_safe_elem‚s zTracHTMLSanitizer.is_safe_elemcCsRd|vr| dd¡d}d|vrdSdd„| dd¡dDƒ}d |¡ ¡|jvS) a:Determine whether the given URI is to be considered safe for inclusion in the output. The default implementation checks whether the scheme of the URI is in the set of allowed URIs (`safe_schemes`). >>> sanitizer = TracHTMLSanitizer() >>> sanitizer.is_safe_uri('http://example.org/') True >>> sanitizer.is_safe_uri('javascript:alert(document.cookie)') False :param uri: the URI to check :return: `True` if the URI can be considered safe, `False` otherwise :rtype: `bool` ú#r(rú:TcSsg|]}| ¡r|‘qSr$)Úisalnum)rmrr$r$r%Ú ªsz1TracHTMLSanitizer.is_safe_uri..r&)Úsplitrvrkr^)rƒÚuriÚcharsr$r$r%Úis_safe_uri”szTracHTMLSanitizer.is_safe_uricCsži}| ¡D]1\}}|dur|}||jvrq||jvr"| |¡s!qn|dkr3| |¡}|s.qd |¡}|||<q|dkrMd|vrM| |d¡sMd}d||<|S)a!Remove potentially dangerous attributes and sanitize the style attribute . :param tag: the tag name of the element :type attrs: dict corresponding to tag attributes :return: a dict containing only safe or sanitized attributes :rtype: dict Nrar{r²r4ÚcrossoriginÚ anonymous)r~r[r]rxÚsanitize_cssrvÚ_is_safe_origin)rƒrr§Ú new_attrsÚattrr=Údeclsr$r$r%Úsanitize_attrss, ÿ ÿz TracHTMLSanitizer.sanitize_attrsc s²g}ˆ ˆ |¡¡}td| d¡ƒD]D}| ¡}|sqz | dd¡\}}Wn ty.Yqwˆ | ¡ ¡| ¡¡s>> sanitizer = TracHTMLSanitizer() >>> sanitizer.sanitize_css(''' ... background: url(javascript:alert("foo")); ... color: #000; ... ''') ['color: #000'] Also, the proprietary Internet Explorer function ``expression()`` is always stripped: >>> sanitizer.sanitize_css(''' ... background: #fff; ... color: #000; ... width: e/**/xpression(alert("F")); ... ''') ['background: #fff', 'color: #000', 'width: e xpression(alert("F"))'] :param text: the CSS text; this is expected to be `str` and to not contain any character or numeric references :return: a list of declarations that are considered safe :rtype: `list` Nú;rrr(c3s |]}ˆ | d¡¡VqdS)r(N)r|r2)rmr8r…r$r%rqøs€ÿz1TracHTMLSanitizer.sanitize_css..)Ú_strip_css_commentsÚ_replace_unicode_escapesrsruÚstripÚ ValueErrorrmrkÚ_EXPRESSION_SEARCHÚallÚ _URL_FINDITERr})rƒr!rÚdeclrlr=r$r…r%r{Îs*ÿ ÿÿ€zTracHTMLSanitizer.sanitize_cssz\r\nz8\\([0-9a-fA-F]{1,6})\s?|\\([^\r\n\f0-9a-fA-F'"{};:()#*])cCs| |¡o t|j|ƒSrl)rxrr_)rƒrvr$r$r%r|s ÿz!TracHTMLSanitizer._is_safe_origincCsdd„}| || d|¡¡S)NcSsZ| d¡}|r t|dƒ}t|ƒ}|dkrd}|S|dkrd}|S| d¡}|dkr+dS|S)Nr(r+érrú\z\\r-)r2r4r5)r8ÚtrÑrŠr$r$r%Ú_repls þ z9TracHTMLSanitizer._replace_unicode_escapes.._replÚ )Ú_UNICODE_ESCAPEÚ_NORMALIZE_NEWLINES)rƒr!rr$r$r%rƒs ÿz*TracHTMLSanitizer._replace_unicode_escapesz /\*.*?\*/cCs| d|¡S)z‹Replace comments with space character instead of superclass which removes comments to avoid problems when nested comments. rr)Ú _CSS_COMMENTS)rƒr!r$r$r%r‚sz%TracHTMLSanitizer._strip_css_commentsN) r™ršr›rœÚ frozensetÚ SAFE_TAGSÚ SAFE_ATTRSÚSAFE_CSSÚSAFE_SCHEMESÚ URI_ATTRSÚSAFE_CROSS_ORIGINSr„ÚreÚcompileÚsearchr†Úfinditerrˆrhrmrprxr€r{r>rÚUNICODErr|rƒr‘r‚r$r$r$r%r äsN þÿêÿÿ!/þþr c@s(eZdZdZdd„Zdd„Zdd„ZdS) raÔHelp base class used for cleaning up HTML riddled with ```` tags and replace them with appropriate ````. The subclass must define a `rules()` static method returning a list of regular expression fragments, each defining a capture group in which the name will be reused for the span's class. Two special group names, ``font`` and ``endfont`` are used to emit ```` and ````, respectively. cCs:t |¡}t|dƒst dd | ¡¡¡|_|j|_|S)NÚ_compiled_rulesz(?:%s)ú|)ÚobjectÚ__new__Úhasattrr™ršrvr0rž)Úclsrƒr$r$r%r¡0s zDeuglifier.__new__cCst |j|j|¡Srl)r™r>ržr )rƒÚindatar$r$r%Úformat7ózDeuglifier.formatcCsF| ¡ ¡D]\}}|r |dkrdS|dkrdSd|SqdS)NrÜzÚendfontzz)Ú groupdictr~)rƒÚ fullmatchÚmtyper8r$r$r%r :sûÿzDeuglifier.replaceN)r™ršr›rœr¡r¥r r$r$r$r%r$s rc@óXeZdZdZdd„Zdd„Zdd„Zdd „Zd d„Zdd „Z dd„Z dd„Zdd„ZdS)Ú HTMLTransformzÖConvenience base class for writing HTMLParsers. The default implementation of the HTMLParser ``handle_*`` methods do nothing, while in our case we try to rewrite the incoming document unmodified. cCsRt |¡||_t|tjƒrdd„|_dSt|tjƒr"dd„|_dSdd„|_dS)NcSót|tƒr | d¡S|S©Nzutf-8)rr”Údecode©rxr$r$r%rzQóÿz(HTMLTransform.__init__..cSrr®)rrÚencoder°r$r$r%rzTr±cSs|Srlr$r°r$r$r%rzWs)rr„rerraÚ TextIOBaseÚ_convertÚIOBase)rƒrer$r$r%r„Ms zHTMLTransform.__init__cCó| | ¡¡dSrl©Ú_writeÚget_starttag_textror$r$r%Úhandle_starttagYr¦zHTMLTransform.handle_starttagcCr¶rlr·ror$r$r%Úhandle_startendtag\r¦z HTMLTransform.handle_startendtagcCó| d|¡dS)Nz ©r¸©rƒÚdatar$r$r%Úhandle_comment_r¦zHTMLTransform.handle_commentcCr¼©Nzr½r¾r$r$r%Úhandle_declbr¦zHTMLTransform.handle_declcCr¼)Núr½r¾r$r$r%Ú handle_pier¦zHTMLTransform.handle_picCs| |¡dSrlr½r¾r$r$r%Úhandle_datahszHTMLTransform.handle_datacCs| d|d¡dS©Nr«rªr½rÃr$r$r%Ú handle_endtagkrzHTMLTransform.handle_endtagcCs|j | |¡¡dSrl)reÚwriter´r¾r$r$r%r¸nrzHTMLTransform._writeN) r™ršr›rœr„rºr»rÀrÂrÄrÅrÇr¸r$r$r$r%r¬Dsr¬c@r¿)rzIdentify and protect forms from CSRF attacks. This filter works by adding a input type=hidden field to POST forms. cCst ||¡||_dSrl)r¬r„Útoken)rƒÚ form_tokenrer$r$r%r„ys zFormTokenInjector.__init__cCsZt |||¡| ¡dkr)|D]\}}|dkr(| ¡dkr(| d|j¡dSqdSdS)NrÝr&Úpostz5)r¬rºrkr¸rÉ)rƒrr§r'r=r$r$r%rº}sÿ€ûz!FormTokenInjector.handle_starttagN)r™ršr›rœr„rºr$r$r$r%rrsrc@r«)r`z-Sanitize parsed HTML using TracHTMLSanitizer.cCst ||¡||_d|_dSrl)r¬r„Ú sanitizerÚwaiting_for)rƒrÌrer$r$r%r„‰s zHTMLSanitization.__init__csh|jrdS|j ||¡s||_dS|j |t|ƒ¡‰d ‡fdd„tˆƒDƒ¡}| d|||f¡dS)Nr&c3s$|] }d|tˆ|ƒfVqdS)r©Nrˆ)rmr'©r}r$r%rq–s€ÿz1HTMLSanitization._handle_start..z<%s%s%s>)rÍrÌrpr€rbrvrur¸)rƒrr§ÚstartendÚ html_attrsr$rÎr%Ú _handle_startŽsÿzHTMLSanitization._handle_startcCó|js| ||d¡dSdS)Nr&©rÍrÑror$r$r%rºšóÿz HTMLSanitization.handle_starttagcCrÒ©Nú/rÓror$r$r%r»žrÔz#HTMLSanitization.handle_startendtagcCsdSrlr$r¾r$r$r%rÀ¢r¤zHTMLSanitization.handle_commentcCs|js| d|¡dSdSrÁ©rÍr¸r¾r$r$r%rÂ¥rÔzHTMLSanitization.handle_declcCs$|js| d| dd¡¡dSdS)NrÃz?>r&)rÍr¸r r¾r$r$r%rÄ©sÿzHTMLSanitization.handle_picCs|js| t|ƒ¡dSdSrl)rÍr¸rr¾r$r$r%rÅrÔzHTMLSanitization.handle_datacCs4|jr|j|kr d|_dSdS| d|d¡dSrÆr×rÃr$r$r%rÇ±s ÿzHTMLSanitization.handle_endtagN) r™ršr›rœr„rÑrºr»rÀrÂrÄrÅrÇr$r$r$r%r`†sr`cCs4t|tƒr | ¡}ntt|ƒƒ}|s| dd¡}|S)a^Extract the text elements from (X)HTML content >>> plaintext('1 < 2') '1 < 2' >>> plaintext(tag('1 ', tag.b('<'), ' 2')) '1 < 2' >>> plaintext('''1 ... < ... 2''', keeplinebreaks=False) '1 < 2' :param text: `unicode` or `Fragment` :param keeplinebreaks: optionally keep linebreaks rŽrr)rrr˜rrr )r!Úkeeplinebreaksr$r$r%r¹s rcCs”t|tƒr,|dur||jvr|S|dur!||j dd¡ ¡vr!|S|dur,||jkr,|St|tƒrF|jD]}t||||ƒ}|durE|Sq4dSdS)zReturn the first element in the fragment having the given attribute, class or tag, using a preorder depth-first search. Nr`r&) rr®rŸÚgetrurrrr)Úfragr~r£rÚchildr—r$r$r%rÔs ÿýrcsÊ|rd|vr | d¡s dStdd„|DƒƒrdS| d¡r&|r&d|j|f}t d¡‰‡fdd „}||ƒ}|D]+}||ƒ}||krDdS| d¡rQ| |¡rQdS| | d ¡rZ|n|d ¡rbdSq7dS)z-Whether the given uri is a safe cross-origin.rrz//Tcss|]}|dkVqdS)Ú*Nr$)rmÚsafer$r$r%rqës€z!is_safe_origin..z%s:%sz&(?:[a-zA-Z][-a-zA-Z0-9+._]*:)?//[^/]+$csˆ |¡r |d7}|SrÕ)r8)rv©Únormalize_rer$r%Ú normalize_uriòs z%is_safe_origin..normalize_urirÖF)r3ÚanyÚschemer™ršÚendswith)r_rvÚreqràrÝr$rÞr%rçs& ÿrcCs|t|tƒst|tƒr't|jƒdkr'|jd}t|tƒst|tƒr't|jƒdkstr1t|tƒr1|j}t|tƒr8|Stt |ƒƒS)z%Convert input to a `Fragment` object.r(r) rr Ú ExceptionÚlenrwrr=rrr )r¹r$r$r%rs ÿ þÿ rccs|] }|dvr|VqdS))é r,é Nr$)rmryr$r$r%rqs€ÿÿrqé cCs| dt¡S)z¿Return only valid bytes in XML/HTML from the given data. >>> valid_html_bytes(b'blah') b'blah' >>> list(valid_html_bytes(bytes(range(33)) + b'')) [9, 10, 13, 32, 127] N)r]Ú_invalid_control_chars)r¿r$r$r%rs r)T)Fr$)NNNrl):rar™ÚsysrrÚhtml.parserrÚ markupsaferrrrrÚImportErrorÚ babel.supportrÚ trac.corer Útrac.util.textr Ú__all__Úname2codepointÚcopyr6rršr=rrÚNO_YESÚOFF_ONÚ FALSE_TRUErbrfrkrcrer rržr®rÀÚxmlrÅrr rr¬rr`rrrrr”Úrangerêrr$r$r$r%ÚsÔÿÿ ; 0ÿÿÿÿþþþþþýýýýýüüüüüûûúúúùø %'+A B . 3