Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

UnicodeEncodeError: 'charmap' codec can't encode character '\ufffd': character maps to <undefined> in generate_html_notices_from_json.py #219

Open
alandtse opened this issue Jun 29, 2022 · 1 comment

Comments

@alandtse
Copy link

On Windows, I'm getting an encoding error when generating a html report where the input copyright data may have bad encodings. On my MacOS device, the same code does not generate the error.

PS C:\Users\1000183352\hub-rest-api-python> python3 examples/generate_html_notices_report_from_json/generate_html_notices_from_json.py notices_report.json Notices.html
Traceback (most recent call last):
  File "C:\Users\1000183352\hub-rest-api-python\examples\generate_html_notices_report_from_json\generate_html_notices_from_json.py", line 21, in <module>
    fh.write(template.render(componentLicenses=fileContent['componentLicenses'],
  File "C:\Program Files\WindowsApps\PythonSoftwareFoundation.Python.3.10_3.10.1520.0_x64__qbz5n2kfra8p0\lib\encodings\cp1252.py", line 19, in encode
    return codecs.charmap_encode(input,self.errors,encoding_table)[0]
UnicodeEncodeError: 'charmap' codec can't encode character '\ufffd' in position 474947: character maps to <undefined>
PS C:\Users\1000183352\hub-rest-api-python>

The specific cause is the fh.write here:

This is tied to the copyright data picked up in the copyright scan. In this case, it's this line in the raw json.

                     "Copyright \ufffd 2001-2008 Python Software Foundation. Copyright",

which when parsed by template.render() and is converted to .

Copyright � 2001-2008 Python Software Foundation.

This is being discovered in the following component in the CopyrightTexts section of notices.json.

               {
                  "componentVersionSummary": {
                     "projectName": "Python programming language",
                     "versionName": "2.7.5"
                  },
                  "originFullName": "centos:python27-python/2.7.5-7.el6.centos.alt/x86_64",
                  "copyrightTexts": [
                     "Copyright (C) 2005-2010   Gregory P. Smith (greg@krypto.org)\n",
                     "Copyright (C) 2006 - 2010  Gregor Lingl\n# email: glingl@aon.at",
                     "Copyright (c) 2004  Anthony Green\n   \n   FR-V Assembly glue.\n",
                     "Copyright Disney Enterprises, Inc.  All Rights Reserved",
                     "Copyright (c) 1999-2009 by Secret Labs AB.  All rights reserved",
                     "Copyright (C) 1995-2005 Jean-loup Gailly and Mark Adler",
                     "Copyright (C) 1994 Steen Lumholt.",
                     "COPYRIGHT SIGN\n    0x00aa: 0x00d1,     #  FEMININE ORDINAL INDICATOR\n    0x00ab: 0x00ae,     #  LEFT-POINTING DOUBLE ANGLE QUOTATION MARK\n    0x00ac: 0x00aa,     #  NOT SIGN\n    0x00ad: 0x00f0,     # ",
                     "Copyright (c) 1999 by Fredrik Lundh\n *\n * By obtaining, using, and/or copying this software and/or its\n * associated documentation, you agree that you have read, understood,\n * and will comply with th",
                     "Copyright (C) 2004, 2005, 2007, 2009 Free Software Foundation, Inc.\n#   Written by Scott James Remnant, 2004.",
                     "Copyright (c) 1997 by Fredrik Lundh",
                     "Copyright (c) 2004 by Secret Labs AB, http://www.pythonware.com\n",
                     "Copyright 1995-2005 Mark Adler \";\n/*\n  If you use the zlib library in a product, an acknowledgment is welcome\n  in the documentation of your product. If for some reason you cannot\n  include such an ac",
                     "Copyright (c) 1996  Red Hat, Inc.\n                  \n   Common internal definitions and macros. Only necessary for building\n   libffi.\n",
                     "Copyright (c) 2012 Tsukasa Oi\n",
                     "Copyright (c) 1999 Toby Dickenson\n *\n * Permission to use this software in any way is granted without\n * fee, provided that the copyright notice above appears in all\n * copies. This software is provid",
                     "copyright(self): pass",
                     "Copyright (C) 1999-2012 Free Software Foundation, Inc.",
                     "Copyright (c) 2003 Python Software Foundation.  All rights reserved",
                     "Copyright (c) 1998, 2008 Red Hat, Inc.\n",
                     "Copyright (c) 2000 John Hornkvist\n",
                     "copyright credits\n",
                     "Copyright (C) 2002 Gregory P. Smith",
                     "COPYRIGHT SIGN\n    u'\\u03a3'   #  0xAA -> GREEK CAPITAL LETTER SIGMA\n    u'\\u03aa'   #  0xAB -> GREEK CAPITAL LETTER IOTA WITH DIALYTIKA\n    u'\\xa7'     #  0xAC -> SECTION SIGN\n    u'\\u2260'   #  0xAD",
                     "(C)\n        self.check_basic_callback(create_function)\n        self.check_basic_callback(create_bound_method)\n        self.check_basic_callback(create_unbound_method)",
                     "Copyright 1998 by Just van Rossum, Den Haag, The Netherlands.",
                     "Copyright (C) 2006-2012 Free Software Foundation, Inc.",
                     "Copyright to show\n",
                     "Copyright (C) 2003 Python Software Foundation",
                     "Copyright (C) 2000  Bastian Kleineidam\n",
                     "copyright(self,arg):",
                     "Copyright (C) 2000  Luke Kenneth Casson Leighton <lkcl@samba.org>",
                     "Copyright (c) 1996-2004  Red Hat, Inc.\n",
                     "Copyright (c) 1998-2002 John Aycock\n",
                     "Copyright (c) 2011 Tilera Corp.",
                     "(C) 2003 Martin v. Loewis\n# See \"FOO\" in comments refers to MSDN sections with the title FOO.\nimport msilib, schema, sequence, os, glob, time, re, shutil, zipfile\nfrom msilib import Feature, CAB, Dire",
                     "Copyright (c) 1999-2001, Digital Creations, Fredericksburg, VA, USA\n  and Andrew Kuchling. All rights reserved",
                     "Copyright (c) 2002, 2003, 2004, 2010,\n   Free Software Foundation, Inc. \n   based on ppc_closure.S",
                     "(C) debugger.  The same is true of\n * the purify_stop_here_internal() function so it isn't exported either.\n * And purify_stop_here() should never be called directly.\n *\n * The header file says purify",
                     "copyright -- copyright notice pertaining to this interpreter\\n\\\nplatform -- platform identifier\\n\\\nexecutable -- absolute path of the executable binary of the Python interpreter\\n\\\nprefix -- prefix us",
                     "Copyright (C) 1995-2005 Jean-loup Gailly.\n * For conditions of distribution and use, see copyright notice in zlib.h\n *\n * Compile this file with -DNO_GZCOMPRESS to avoid the compression code.\n",
                     "Copyright (c) 2003, 2004, 2006, 2007, 2012 Kaz Kojima\n",
                     "Copyright (c) 2004 Python Software Foundation.\n   All rights reserved",
                     "copyright(self):\n        # 'copyright' and 'credits' should be in __builtin__\n        self.assertTrue(hasattr(__builtin__, \"copyright\"))\n        self.assertTrue(hasattr(__builtin__, \"credits\"))",
                     "Copyright\n__copyright__ = \"\"\"\\\n",
                     "Copyright (c) 1999-2002 by Secret Labs AB\n",
                     "Copyright (c) 1998 Geoffrey Keating\n",
                     "Copyright (c) 1998  Red Hat, Inc.\n   \n   ARM Foreign Function Interface \n",
                     "Copyright (c) 2012 Alan Hourihane\n",
                     "Copyright (C) 1999, 2002 Aladdin Enterprises.  All rights reserved",
                     "Copyright (c) 1999-2000 by Secret Labs AB",
                     "COPYRIGHT SIGN\n    u'\\xa7'     #  0xB5 -> SECTION SIGN\n    u'\\xb6'     #  0xB6 -> PILCROW SIGN\n    u'\\xbc'     #  0xB7 -> VULGAR FRACTION ONE QUARTER\n    u'\\xbd'     #  0xB8 -> VULGAR FRACTION ONE HAL",
                     "copyright: 2007-2010 by Georg Brandl.\n",
                     "Copyright (C) 2004-2006 Python Software Foundation\n# Authors: Baxter, Wouters and Warsaw\n# Contact: email-sig@python.org",
                     "Copyright (C) 2001-2013 Vinay Sajip. All Rights Reserved",
                     "Copyright \ufffd 2001-2008 Python Software Foundation. Copyright",
                     "Copyright (c) 2003-2005 by Peter Astrand <astrand@lysator.liu.se>\n",
                     "Copyright (c) 2010, Plausible Labs Cooperative, Inc.\n#  \n#  ARM Trampoline Page Generator\n",
                     "Copyright (c) 1995-2001 Corporation for National Research Initiatives.\nAll rights reserved",
                     "Copyright (c) 2008 Guido U. Draheim <guidod@gmx.de>\n",
                     "Copyright (C) 2001 Python Software Foundation\n# Barry Warsaw <barry@python.org>, 2000.",
                     "Copyright (c) 1996,1998,2001-2003,2005,2008,2010  Red Hat, Inc.\n   \n   X86 Foreign Function Interface \n",
                     "Copyright (C) 2002-2006 Python Software Foundation\n# Author: Ben Gertzfield\n# Contact: email-sig@python.org",
                     "Copyright (c) 2002  Ranjit Mathew\n\t\t\t\n",
                     "Copyright (c) 1998, 1999 Thai Open Source Software Center Ltd\nSee the file COPYING for copying permission.\n*/\n",
                     "Copyright (c) 2009  Daniel Witte\n\t\t\t\n",
                     "(C)\n            self.assertTrue(issubclass(C, B))",
                     "Copyright 2009 Gabriel A. Genellina",
                     "Copyright (C) 2004-2012 Free Software Foundation, Inc.",
                     "Copyright (c) 1991, 2000, 2001 by Lucent Technologies.\n *\n * Permission to use, copy, modify, and distribute this software for any\n * purpose without fee is hereby granted, provided that this entire n",
                     "Copyright (c) 2004 by Fredrik Lundh <fredrik@pythonware.com>\n",
                     "Copyright (c) 2010, Plausible Labs Cooperative, Inc.\n",
                     "Copyright (c) 1998 Cygnus Solutions\n",
                     "Copyright (c) 2002  Ranjit Mathew\n",
                     "Copyright 2001-2012 by Vinay Sajip. All Rights Reserved",
                     "Copyright 1995-1996 by Fred L. Drake, Jr. and Virginia Polytechnic\n *  Institute and State University, Blacksburg, Virginia, USA.\n",
                     "Copyright (C) 1997, 2002, 2003 Martin von Loewis",
                     "Copyright (c) 1996-2012  Anthony Green, Red Hat, Inc and others.\n",
                     "COPYRIGHT SIGN\n    u'\\u037a'   #  0xAA -> GREEK YPOGEGRAMMENI\n    u'\\xab'     #  0xAB -> LEFT-POINTING DOUBLE ANGLE QUOTATION MARK\n    u'\\xac'     #  0xAC -> NOT SIGN\n    u'\\xad'     #  0xAD -> SOFT H",
                     "Copyright 2000, Mojam Media, Inc., all rights reserved",
                     "Copyright (c) 2004, 2010 Free Software Foundation, Inc.",
                     "Copyright (c) 2004, 2005, 2006 Python Software Foundation.\n   All rights reserved",
                     "Copyright (c) 2000 Doug White, 2006 James Knight, 2007 Christian Heimes\n * All rights reserved",
                     "Copyright (c) 2008  Bj\n\t\n   X86 Foreign Function Interface for FreeBSD\n",
                     "copyright}\n  \\gdef\n  \\gdef^^ab{\\guillemetleft}\n  \\gdef^^ac{$\\lnot$}\n  \\gdef^^ad{\\-}\n  \\gdef^^ae{\\registeredsymbol}\n  \\gdef^^af{\\={}}\n  %\n  \\gdef^^b0{\\textdegree}\n  \\gdef^^b1{$\\pm$}\n  \\gdef^^b2{$^2$}\n ",
                     "Copyright (c), 2000-2006, eGenix.com Software GmbH (info@egenix.com)\nSee the documentation for further information on copyrights,\nor contact the author. All Rights Reserved",
                     "Copyright (c) 2000-2013 Python Software Foundation.\nAll rights reserved",
                     "Copyright \ufffd 1995-2001 CNRI. Copyright\n            VALUE \"OriginalFilename\", PYTHON_DLL_NAME \"\\0\"\n            VALUE \"ProductName\", \"Python\\0\"\n            VALUE \"ProductVersion\", PYTHON_VERSION\n        ",
                     "Copyright (C) 2005-2007 Gerhard H",
                     "Copyright (c) 2003-2010 Python Software Foundation\nThis module is free software, and you may redistribute it and/or modify\n",
                     "(c) Copyright CNRI, All Rights Reserved",
                     "Copyright (C) 2004-2010 Gerhard H\n *\n * This file is part of pysqlite.\n",
                     "copyright = _Printer(\"copyright\", sys.copyright)\n    if sys.platform[:4] == 'java':\n",
                     "(C)\n        self.assertIn((\n        self.assertIn(('c', 'method', C), attrs, 'missing plain method')\n        self.assertIn(('p', 'property', A), attrs, 'missing property')\n        self.assertIn(('m', ",
                     "Copyright (c) 2012, 2013  Anthony Green\n   \n   Moxie Assembly glue.\n",
                     "Copyright (c) 2007 by Christian Heimes <christian@cheimes.de>\n",
                     "Copyright (C) 2001,2002 Python Software Foundation\n# csv package unit tests",
                     "Copyright (C) 1997-2012 Free Software Foundation, Inc.",
                     "Copyright (c) 1999-2008 by Fredrik Lundh.  All rights reserved",
                     "(c) Copyright Marc-Andre Lemburg, 2005.\n",
                     "Copyright (c) 2006-2008, R Oudkerk --- see COPYING.txt\n */\n",
                     "Copyright (c) 2001-2006 Twisted Matrix Laboratories.\n",
                     "Copyright (c) 2001, 2002, 2003, 2004, 2005, 2006 Expat maintainers.\n",
                     "Copyright (C) 2002-2012 Free Software Foundation, Inc.",
                     "Copyright (C) 2007 Free Software Foundation, Inc.",
                     "COPYRIGHT SIGN\n    0x00a9: 0x00ae,     #  REGISTERED SIGN\n    0x00aa: 0x00ac,     #  NOT SIGN\n    0x00ab: 0x00bd,     #  VULGAR FRACTION ONE HALF\n    0x00ac: 0x00bc,     #  VULGAR FRACTION ONE QUARTER",
                     "Copyright (C) 2004-2010 Gerhard H",
                     "Copyright (c) 1996, 1998, 2001, 2002  Red Hat, Inc.\n",
                     "Copyright (c) 2002  Bo Thorsen\n",
                     "Copyright (C) 1995-2005 Mark Adler\n * For conditions of distribution and use, see copyright notice in zlib.h\n *\n * Thanks to Rodney Brown <rbrown64@csc.com.au> for his contribution of faster\n * CRC me",
                     "COPYRIGHT SIGN\n    u'\\x1a'     #  0xFC -> SUBSTITUTE\n    u'\\x1a'     #  0xFD -> SUBSTITUTE\n    u'\\xbb'     #  0xFE -> RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK\n    u'\\x9f'     #  0xFF -> CONTROL\n)\n",
                     "Copyright (c) 1999-2009 by Fredrik Lundh",
                     "Copyright (C) 2008  Free Software Foundation, Inc.",
                     "Copyright(self):\n",
                     "COPYRIGHT SIGN\n    0x00ab: 0x00ae,     #  LEFT-POINTING DOUBLE ANGLE QUOTATION MARK\n    0x00ac: 0x0089,     #  NOT SIGN\n    0x00ad: 0x00f0,     #  SOFT HYPHEN\n    0x00b0: 0x00f8,     #  DEGREE SIGN\n  ",
                     "Copyright (c) 2011 Plausible Labs Cooperative, Inc.\n",
                     "Copyright (C) 2005 Gerhard H",
                     "Copyright (c) 2000 BeOpen.com.\\n\\\nAll Rights Reserved",
                     "Copyright (c) 2011  Anthony Green\n",
                     "Copyright (c) 2000, 2003, 2004, 2008 Red Hat, Inc.\n   \n   SPARC 64-bit Foreign Function Interface \n",
                     "Copyright()));\n    SET_SYS_FROM_STRING(\"platform\",\n                        PyString_FromString(Py_GetPlatform()));\n    SET_SYS_FROM_STRING(\"executable\",\n                        PyString_FromString(Py_",
                     "Copyright (c) 2008 Anthony Green\n   \n   SuperH SHmedia Foreign Function Interface \n",
                     "Copyright (c) 2004 Simon Posnjak\n",
                     "Copyright (c) 2000, BeOpen.com.\n",
                     "copyright others, see the AUTHORS file included with this\ndistribution.",
                     "Copyright (c) 2002  Bo Thorsen <bo@suse.de>",
                     "Copyright (C) 2011, 2012  Anthony Green\n",
                     "Copyright (c) 2006-2008, R Oudkerk\n# All rights reserved",
                     "Copyright (c) 2002, 2009 Free Software Foundation, Inc.\n   based on darwin.S by John Hornkvist",
                     "Copyright 1995-2005 Jean-loup Gailly \";\n/*\n  If you use the zlib library in a product, an acknowledgment is welcome\n  in the documentation of your product. If for some reason you cannot\n  include such",
                     "Copyright (c) 1999 by Secret Labs AB\n",
                     "Copyright (C) 2001-2012 Vinay Sajip. All Rights Reserved",
                     "Copyright (c) 1996, 1998  Red Hat, Inc.\n   \n   Predefined ffi_types needed by libffi.\n",
                     "Copyright (c) 2002  Roger Sayle\n",
                     "Copyright (C) 1995-2005 Jean-loup Gailly.\n * For conditions of distribution and use, see copyright notice in zlib.h\n */\n",
                     "Copyright (c) 2001-2013 Python Software Foundation.\\n\\\nAll Rights Reserved",
                     "Copyright (c) 2013 Imagination Technologies Ltd.",
                     "Copyright (c) 2003-2007 Python Software Foundation.\n   All rights reserved",
                     "COPYRIGHT SIGN\n    u'\\xae'     #  0x00a9 -> REGISTERED SIGN\n    u'\\xac'     #  0x00aa -> NOT SIGN\n    u'\\xbd'     #  0x00ab -> VULGAR FRACTION ONE HALF\n    u'\\xbc'     #  0x00ac -> VULGAR FRACTION ONE",
                     "Copyright (C) 2002-2006 Python Software Foundation\n# Author: Ben Gertzfield, Barry Warsaw\n# Contact: email-sig@python.org",
                     "Copyright (c) 2012 Anthony Green\n",
                     "COPYRIGHT = 0x01\nDVD_STRUCT_DISCKEY = 0x02\nDVD_STRUCT_BCA = 0x03\nDVD_STRUCT_MANUFACT = 0x04\nDVD_LAYERS = 4\nDVD_LU_SEND_AGID = 0\nDVD_HOST_SEND_CHALLENGE = 1\nDVD_LU_SEND_KEY1 = 2\nDVD_LU_SEND_CHALLENGE =",
                     "Copyright (c) 2003 Jakub Jelinek <jakub@redhat.com>\n",
                     "Copyright (c) 1996.                           \\\n   |          The Regents of the University of California.                 |\n   |                        All rights reserved",
                     "Copyright (C) 1995-2003 Mark Adler\n * For conditions of distribution and use, see copyright notice in zlib.h\n */\n",
                     "Copyright (c) 2002 Peter O\n",
                     "Copyright 1992-1994, David Gottner\n *\n *                    All Rights Reserved",
                     "Copyright (c) 2002, 2003, 2009 Free Software Foundation, Inc.\n   based on darwin_closure.S",
                     "Copyright (C) 1994 X Consortium\n",
                     "Copyright (C) 1986 Gary S. Brown.  You may use this program, or\n code or tables extracted from it, as desired without restriction.",
                     "Copyright (c) 1991-1995 Stichting Mathematisch Centrum, Amsterdam.\\n\\\nAll Rights Reserved",
                     "Copyright (c) 1998, 1999 Thai Open Source Software Center Ltd\n   See the file COPYING for copying permission.\n*/\n",
                     "Copyright (c) 1999-2003 Steve Purcell\n",
                     "Copyright (c) 1998-2001 by Secret Labs AB.  All rights reserved",
                     "Copyright = Button(py_button_f, text=\n                                 highlightbackground=self.bg,\n",
                     "Copyright (C) 2001-2006 Python Software Foundation\n# Author: Anthony Baxter\n# Contact: email-sig@python.org",
                     "Copyright (c) 2003-2004 by Fredrik Lundh.  All rights reserved",
                     "Copyright (C) 2004, 2005, 2007, 2008 Free Software Foundation, Inc.\n# Written by Gary V. Vaughan, 2004",
                     "Copyright (c) Corporation for National Research Initiatives.",
                     "Copyright (c) 1996  Red Hat, Inc.",
                     "Copyright (c) 2001, 2002, 2003, 2004, 2005, 2006 Python Software Foundation.\n# All rights reserved",
                     "(C) from getting tp_clear\n        # The result was a segfault (C.__mro__ was NULL when the callback\n        # tried to look up self.me).",
                     "Copyright 1994 by Lance Ellinghouse,\n *    Cathedral City, California Republic, United States of America.\n *\n *   Version 1.5b1, heavily extended for ncurses by Oliver Andrich:\n",
                     "Copyright (C) 2003-2004 Federico Di Gregorio <fog@debian.org>\n",
                     "Copyright (C) 2002 Gregory P. Smith\n",
                     "Copyright (c) 1996, 1998, 2007  Red Hat, Inc.\n",
                     "(C)/!b go\n\t:more\n\t/\\./!{\n\t  N\n\t  s/\\n# / /\n\t  b more\n",
                     "Copyright (c) 2013 Imagination Technologies Ltd.\n",
                     "copyright: Copyright 2008 by Armin Ronacher.\n",
                     "Copyright (c) 2009  Bradley Smith <brad@brad-smith.co.uk>\n",
                     "Copyright Howard Jones <ha.jones@ic.ac.uk>, September 1994.",
                     "Copyright (c) 2012, 2013  Anthony Green\n",
                     "Copyright 1996 by Sam Rushing",
                     "Copyright 2000 by Timothy O",
                     "copyright as Python attribute, not global\n * 2001-04-28 fl  added __copy__ methods (work in progress)\n * 2001-05-14 fl  fixes for 1.5.2 compatibility\n * 2001-07-01 fl  added BIGCHARSET support (from M",
                     "Copyright (c) 2013 Tensilica, Inc.",
                     "Copyright (c) 2011 Anthony Green\n",
                     "Copyright (C) 1995, 1996, 1997, 1998, and 1999 WIDE Project.\n * All rights reserved",
                     "Copyright (c) 2002  Bo Thorsen <bo@suse.de>\n   \n   x86-64 Foreign Function Interface \n",
                     "Copyright 2006 Google, Inc. All Rights Reserved",
                     "Copyright (c) 2008 Matteo Frigo\n",
                     "Copyright = FOUR_CHAR_CODE(\nkUserDataTextCreationDate = FOUR_CHAR_CODE('\\xa9day')\nkUserDataTextDescription = FOUR_CHAR_CODE('\\xa9des')\nkUserDataTextDirector = FOUR_CHAR_CODE('\\xa9dir')\nkUserDataTextDi",
                     "Copyright (c) 1990-1995, Stichting Mathematisch Centrum.\n# All rights reserved",
                     "Copyright (C) 2004  Anthony Green\n",
                     "Copyright (C) 2008, 2010  Free Software Foundation, Inc.",
                     "Copyright (C) 1996-2012 Free Software Foundation, Inc.",
                     "Copyright (c) 2011 Anthony Green\n   \n   IA64 Foreign Function Interface \n",
                     "Copyright (C) 2000, 2001 by Autonomous Zone Industries\n",
                     "Copyright (C) 2006-2007 Gerhard H",
                     "Copyright)\n        buttonCopyright.pack(side=LEFT, padx=10, pady=10)\n",
                     "Copyright assigned to the Python Software Foundation, 2001.\n */\n",
                     "Copyright (c) 2011 Free Software Foundation\n",
                     "Copyright (C) 2001-2010 Python Software Foundation\n# Author: Barry Warsaw\n# Contact: email-sig@python.org",
                     "copyright\n  \\definedummyword\\registeredsymbol\n  \\definedummyword\\dots\n  \\definedummyword\\enddots\n  \\definedummyword\\entrybreak\n  \\definedummyword\\equiv\n  \\definedummyword\\error\n  \\definedummyword\\euro",
                     "(c) Copyright 2005, Marc-Andre Lemburg (mal@lemburg.com).\n",
                     "Copyright (c) 2008 Red Hat, Inc.\n",
                     "Copyright 2001-2013 by Vinay Sajip. All Rights Reserved",
                     "Copyright (C) 1995-2005 Jean-loup Gailly\n * For conditions of distribution and use, see copyright notice in zlib.h\n */\n",
                     "Copyright (C) 2011 Anthony Green\n",
                     "Copyright (c) 2002, 2003, 2004, Free Software Foundation,\n   Inc. based on ppc_closure.S",
                     "copyright \\memberline \\backslash \\pi \\centerline\n    \\caption \\vspace \\textwidth \\menuselection \\textless\n    \\makevar \\csimplemacro \\menuselection \\bfcode \\sub \\release\n    \\email \\kwindex \\refexmodi",
                     "Copyright (c) 1991-1995 Stichting Mathematisch Centrum.\nAll rights reserved",
                     "COPYRIGHT SIGN\n    0x00b9: 0x2563,     #  BOX DRAWINGS DOUBLE VERTICAL AND LEFT\n    0x00ba: 0x2551,     #  BOX DRAWINGS DOUBLE VERTICAL\n    0x00bb: 0x2557,     #  BOX DRAWINGS DOUBLE DOWN AND LEFT\n   ",
                     "Copyright (c) 1996, 1998, 1999, 2001, 2007, 2008  Red Hat, Inc.\n",
                     "Copyright (c) 1997-2001 by Secret Labs AB.  All rights reserved",
                     "Copyright (c) 2013  The Written Word, Inc.\n",
                     "COPYRIGHT SIGN\n    u'\\u044e'   #  0xC0 -> CYRILLIC SMALL LETTER YU\n    u'\\u0430'   #  0xC1 -> CYRILLIC SMALL LETTER A\n    u'\\u0431'   #  0xC2 -> CYRILLIC SMALL LETTER BE\n    u'\\u0446'   #  0xC3 -> CYR",
                     "Copyright (C) 2011 Free Software Foundation, Inc.\nThis config.lt script is free software; the Free Software Foundation\ngives unlimited permision to copy, distribute and modify it.\"",
                     "Copyright (C) 2007, 2008, 2010 Free Software Foundation, Inc\n",
                     "Copyright (c) 1998 Geoffrey Keating",
                     "Copyright (c) 1998, 2008, 2011 Red Hat, Inc.\n",
                     "Copyright (c) 1993 George V. Neville-Neil\n * All rights reserved",
                     "COPYRIGHT SIGN\n    u'\\u2122'   #  0xAA -> TRADE MARK SIGN\n    u'\\xb4'     #  0xAB -> ACUTE ACCENT\n    u'\\xa8'     #  0xAC -> DIAERESIS\n    u'\\u2260'   #  0xAD -> NOT EQUAL TO\n    u'\\u0102'   #  0xAE -",
                     "Copyright (C) 2004 Python Software Foundation\n",
                     "Copyright (c) 2012 Tilera Corp.\n",
                     "(c) Copyright Guido van Rossum, 2000.",
                     "copyright      show copyright\n  --examples       show examples of usage",
                     "(C) 1995-2004 Jean-loup Gailly and Mark Adler",
                     "Copyright (c) 2008 by Christian Heimes <christian@cheimes.de>\n",
                     "Copyright (c) 2002-2008, 2012 Kaz Kojima\n",
                     "Copyright (c) 2002  Roger Sayle\n   \n   x86 Foreign Function Interface \n",
                     "copyright 1991-1995 by Stichting Mathematisch\\n\\\nCentrum, Amsterdam, The Netherlands.\";",
                     "COPYRIGHT SIGN\n    u'\\xaa'     #  0xAA -> FEMININE ORDINAL INDICATOR\n    u'\\xab'     #  0xAB -> LEFT-POINTING DOUBLE ANGLE QUOTATION MARK\n    u'\\xac'     #  0xAC -> NOT SIGN\n    u'\\xad'     #  0xAD ->",
                     "copyright\n * (C) 2000 Peter Bosch, all rights reserved",
                     "Copyright (C) 2001-2012 Free Software Foundation, Inc.",
                     "Copyright (C) 1995-2004 Jean-loup Gailly\n * For conditions of distribution and use, see copyright notice in zlib.h\n */\n",
                     "Copyright (C) 1995-2004 Jean-loup Gailly.\n * For conditions of distribution and use, see copyright notice in zlib.h\n */\n",
                     "Copyright (c) 2003-2009 by Fredrik Lundh.  All rights reserved",
                     "Copyright(void);\nPyAPI_FUNC(const char *) Py_GetCompiler(void);\nPyAPI_FUNC(const char *) Py_GetBuildInfo(void);\nPyAPI_FUNC(const char *) _Py_svnversion(void);\nPyAPI_FUNC(const char *) Py_SubversionRev",
                     "Copyright (c) 1999-2001, Digital Creations, Fredericksburg, VA, USA\n#  and Andrew Kuchling. All rights reserved",
                     "Copyright (c) 2002  Python Software Foundation; All Rights Reserved",
                     "(C)  # should pass silently\n        self.assertRaises(RuntimeError, C.register, A)  # cycles not allowed\n        C.register(B)  # ok",
                     "Copyright (c) 1998 The Open Group\n",
                     "Copyright (c) 2013 Imagination Technologies\n",
                     "Copyright 1994 by Lance Ellinghouse,\nCathedral City, California Republic, United States of America.",
                     "Copyright 1995-1997, Automatrix, Inc., all rights reserved",
                     "Copyright (c) 1996 Red Hat, Inc.\n",
                     "Copyright (c) 1995-2001 Corporation for National Research Initiatives.\\n\\\nAll Rights Reserved",
                     "COPYRIGHT SIGN\n    0x00aa: 0x00a6,     #  FEMININE ORDINAL INDICATOR\n    0x00ab: 0x00ae,     #  LEFT-POINTING DOUBLE ANGLE QUOTATION MARK\n    0x00ac: 0x00aa,     #  NOT SIGN\n    0x00ad: 0x00f0,     # ",
                     "Copyright 1994 by Lance Ellinghouse\n# Cathedral City, California Republic, United States of America.\n#                        All Rights Reserved",
                     "copyright()\n    sethelper()\n    aliasmbcs()\n    setencoding()\n    execsitecustomize()\n    if ENABLE_USER_SITE:\n",
                     "COPYRIGHT SIGN\n    u'\\xd7'     #  0xAA -> MULTIPLICATION SIGN\n    u'\\xab'     #  0xAB -> LEFT-POINTING DOUBLE ANGLE QUOTATION MARK\n    u'\\xac'     #  0xAC -> NOT SIGN\n    u'\\xad'     #  0xAD -> SOFT H",
                     "Copyright 1999, Bioreason, Inc., all rights reserved",
                     "Copyright (c) 2009  Bradley Smith <brad@brad-smith.co.uk>",
                     "Copyright 2006 Georg Brandl.\n",
                     "Copyright (c) 2002-2006 Python Software Foundation.  All rights reserved",
                     "(C) isn\n        # objects reachable from the dying object (c1) isn't enough to stop\n",
                     "Copyright (c) 1996, 1998, 2005  Red Hat, Inc.\n   \n   MIPS Foreign Function Interface \n",
                     "Copyright (c) 2003, 2004, 2006, 2008 Kaz Kojima\n   \n   SuperH SHmedia Foreign Function Interface \n",
                     "Copyright (c) 1998, 2012 Andreas Schwab\n",
                     "Copyright (c) 2008, 2010  Red Hat, Inc.\n",
                     "Copyright (c) 2010 Rhys Ulerich <rhys.ulerich@gmail.com>\n",
                     "copyright 1991-1995 by Stichting Mathematisch Centrum,\n *  Amsterdam, The Netherlands.  Copying is permitted under the terms\n *  associated with the main Python distribution, with the additional\n *  r",
                     "Copyright (c) 2002 Jorge Acereda  <jacereda@users.sourceforge.net>\n                   Peter O'Gorman <ogorman@users.sourceforge.net>\n                   \n",
                     "COPYRIGHT SIGN\n    u'\\ufffe'   #  0xAA -> UNDEFINED\n    u'\\xab'     #  0xAB -> LEFT-POINTING DOUBLE ANGLE QUOTATION MARK\n    u'\\xac'     #  0xAC -> NOT SIGN\n    u'\\xad'     #  0xAD -> SOFT HYPHEN\n    ",
                     "Copyright (c) 1999-2000, Marc-Andre Lemburg; mailto:mal@lemburg.com\n",
                     "(C)\n        self.assertEqual(o.__dict__,\n        del o\n        o = new.instance(C, None)\n        self.assertEqual(o.__dict__, {}, \"new __dict__ should be empty\")\n        del o",
                     "Copyright (c) 2008  David Daney\n",
                     "COPYRIGHT SIGN\n    u'\\u015e'   #  0xAA -> LATIN CAPITAL LETTER S WITH CEDILLA\n    u'\\xab'     #  0xAB -> LEFT-POINTING DOUBLE ANGLE QUOTATION MARK\n    u'\\xac'     #  0xAC -> NOT SIGN\n    u'\\xad'     #",
                     "Copyright (C) 1995-2004 Mark Adler\n * For conditions of distribution and use, see copyright notice in zlib.h\n */\n",
                     "Copyright (c) 1995-2000, Corporation for National Research Initiatives.\n",
                     "Copyright (c) 2000 Hewlett Packard Company\n",
                     "(C) 2003 Martin v. Loewis",
                     "Copyright (C) 1997-1998 Greg Stein. All Rights Reserved",
                     "Copyright (C) 2005, 2006 Martin von L\n",
                     "Copyright 1996,1997 by Oliver Andrich, Koblenz, Germany.\n *\n *   Tidied for Python 1.6, and currently maintained by <amk@amk.ca>.\n",
                     "Copyright (c) 2012  Anthony Green\n",
                     "COPYRIGHT SIGN\n    u'\\u2122'   #  0xAA -> TRADE MARK SIGN\n    u'\\xb4'     #  0xAB -> ACUTE ACCENT\n    u'\\xa8'     #  0xAC -> DIAERESIS\n    u'\\u2260'   #  0xAD -> NOT EQUAL TO\n    u'\\xc6'     #  0xAE -",
                     "Copyright (c) 1990-1995, Stichting Mathematisch Centrum.\nAll rights reserved",
                     "Copyright (C) 1993 by Sun Microsystems, Inc. All rights reserved",
                     "Copyright (C) 2002-2007 Python Software Foundation\n# Contact: email-sig@python.org",
                     "Copyright (c) 1996, 1998, 2001, 2002, 2009  Red Hat, Inc.\n",
                     "copyright sign, U\n    'crarr':    0x21b5, # downwards arrow with corner leftwards = carriage return, U+21B5 NEW\n    'cup':      0x222a, # union = cup, U+222A ISOtech\n    'curren':   0x00a4, # currency",
                     "Copyright (C) 2002-2004 Python Software Foundation",
                     "Copyright (c) 2002, 2007  Bo Thorsen <bo@suse.de>\n             \n   x86-64 Foreign Function Interface \n",
                     "copyright = __copyright__",
                     "Copyright (c) 2001  John Beniton\n",
                     "Copyright (c) 2008 Red Hat, Inc.",
                     "Copyright (c) 2008 Steven G. Johnson <stevenj@alum.mit.edu>\n",
                     "Copyright 2004-2005 Elemental Security, Inc. All Rights Reserved",
                     "Copyright (C) 1999, 2000, 2002 Aladdin Enterprises.  All rights reserved",
                     "Copyright (c) 2004 by Peter Astrand <astrand@lysator.liu.se>\n *\n * By obtaining, using, and/or copying this software and/or its\n * associated documentation, you agree that you have read, understood,\n ",
                     "copyright % self.__dict__)\n        print\n        return 0",
                     "COPYRIGHT SIGN\n    u'\\u0156'   #  0xAA -> LATIN CAPITAL LETTER R WITH CEDILLA\n    u'\\xab'     #  0xAB -> LEFT-POINTING DOUBLE ANGLE QUOTATION MARK\n    u'\\xac'     #  0xAC -> NOT SIGN\n    u'\\xad'     #",
                     "Copyright (C) 2009\n# the Initial Developer. All Rights Reserved",
                     "COPYRIGHT SIGN\n    u'\\u2563'   #  0x00b9 -> BOX DRAWINGS DOUBLE VERTICAL AND LEFT\n    u'\\u2551'   #  0x00ba -> BOX DRAWINGS DOUBLE VERTICAL\n    u'\\u2557'   #  0x00bb -> BOX DRAWINGS DOUBLE DOWN AND LE",
                     "Copyright (c) 1996, 1998, 2005, 2007, 2009, 2010  Red Hat, Inc.\n   \n   MIPS Foreign Function Interface \n",
                     "Copyright (c) 2004  Renesas Technology\n",
                     "Copyright (C) 2001-2006 Python Software Foundation\n# Author: Barry Warsaw\n# Contact: email-sig@python.org",
                     "COPYRIGHT SIGN\n    u'\\u1e82'   #  0xAA -> LATIN CAPITAL LETTER W WITH ACUTE\n    u'\\u1e0b'   #  0xAB -> LATIN SMALL LETTER D WITH DOT ABOVE\n    u'\\u1ef2'   #  0xAC -> LATIN CAPITAL LETTER Y WITH GRAVE\n",
                     "Copyright 1991-1995, Stichting Mathematisch Centrum, all rights reserved",
                     "Copyright (C) 2001-2006 Python Software Foundation\n# Author: Keith Dart\n# Contact: email-sig@python.org",
                     "Copyright (C) 1995, 1996, 1997, and 1998 WIDE Project.\n * All rights reserved",
                     "Copyright (C) 2002 Python Software Foundation\n# email package unit tests for (optional) Asian codecs",
                     "COPYRIGHT SIGN\n    0x00ab: 0x00ae,     #  LEFT-POINTING DOUBLE ANGLE QUOTATION MARK\n    0x00ac: 0x00aa,     #  NOT SIGN\n    0x00ad: 0x00f0,     #  SOFT HYPHEN\n    0x00ae: 0x00a9,     #  REGISTERED SIG",
                     "Copyright (c), 1997-2006, Marc-Andre Lemburg (mal@lemburg.com)\n",
                     "Copyright (c) 2012, 2013 Xilinx, Inc",
                     "Copyright (C) 2005   Gregory P. Smith (greg@krypto.org)\n",
                     "Copyright (c) 2008  Red Hat, Inc.\n   \n   M32R Foreign Function Interface \n",
                     "(C)\n     AC_LINK_IFELSE(\n     AC_LANG_POP])\n  if test x\"$lt_cv_cc_needs_belf\" != x\"yes\"; then\n    # this is probably gcc 2.8.0, egcs 1.0 or newer; no need for -belf\n    CFLAGS=\"$SAVE_CFLAGS\"\n  fi\n  ;;",
                     "Copyright 1991-1995 by Stichting Mathematisch Centrum, Amsterdam,\nThe Netherlands.",
                     "Copyright (c) 2004 Python Software Foundation.\n# All rights reserved",
                     "Copyright (C) 2007 Gerhard H",
                     "Copyright (C) 2002, 2003 Python Software Foundation.\n# Written by Greg Ward <gward@python.net>",
                     "Copyright (c) 1996-2003, 2010  Red Hat, Inc.\n",
                     "Copyright (C) 2001,2002 Python Software Foundation, and were\nwritten by Barry Warsaw.\n\"\"\"\n",
                     "Copyright (c) 1999-2009 by Fredrik Lundh.\n *\n * info@pythonware.com\n * http://www.pythonware.com\n",
                     "Copyright (c) 1996, 1998, 2001, 2002, 2003, 2005  Red Hat, Inc.\n",
                     "COPYRIGHT SIGN\n    u'\\u06be'   #  0xAA -> ARABIC LETTER HEH DOACHASHMEE\n    u'\\xab'     #  0xAB -> LEFT-POINTING DOUBLE ANGLE QUOTATION MARK\n    u'\\xac'     #  0xAC -> NOT SIGN\n    u'\\xad'     #  0xAD",
                     "Copyright 2007 Georg Brandl.\n",
                     "Copyright (c) 2002 __MyCompanyName__. All rights reserved",
                     "Copyright (C) 2004 Free Software Foundation, Inc.\n#   Written by Scott James Remnant, 2004",
                     "(C)\n        # Trigger cache.\n        C().f()\n        del C\n        test_support.gc_collect()\n        self.assertEqual(r(), None)",
                     "Copyright (c) 1996-2003  Red Hat, Inc.",
                     "COPYRIGHT(s): return __IDSTRING(__CONCAT(__copyright_,__LINE__),s)",
                     "Copyright (c) 2001, 2002, 2003, 2004, 2005, 2006, 2007, 2008, 2009, 2010,\n",
                     "Copyright (c) 2004 Free Software Foundation, Inc.",
                     "Copyright (C) 2001-2010 Python Software Foundation\n# Contact: email-sig@python.org\n# email package unit tests",
                     "Copyright (c) 1996, 1998  Red Hat, Inc.\n",
                     "Copyright (c) 1998, 2001, 2007, 2008  Red Hat, Inc.\n   \n   Alpha Foreign Function Interface \n",
                     "copyright(self): self.write(self.COPYRIGHT_SYMBOL)\n",
                     "Copyright (c) 2001, 2002, 2003, 2004, 2005, 2006, 2007, 2008, 2009, 2010, 2011,\n2012, 2013 Python Software Foundation.  All rights reserved",
                     "Copyright (c) 1999, 2008  Red Hat, Inc.",
                     "Copyright (c) 1999-2002 by Fredrik Lundh.",
                     "Copyright (C) 2008  Red Hat, Inc.\n   \n   FR-V Foreign Function Interface \n",
                     "Copyright (c) 2000, 2007 Software AG\n",
                     "Copyright (c) 2002, 2003, 2004, Free Software Foundation,\n",
                     "Copyright (c) 1998, 2001, 2007, 2008, 2011 Red Hat\n   \n   Alpha/OSF Foreign Function Interface \n",
                     "Copyright (c), 2000-2006, eGenix.com Software GmbH (info@egenix.com)",
                     "Copyright (c) 2004  Renesas Technology.\n",
                     "Copyright (C) 1996, 1997, 1998, 1999, 2000, 2001, 2003, 2004, 2005,\n#                 2006, 2007, 2008, 2009, 2010, 2011 Free Software\n#                 Foundation, Inc.\n#   Written by Gordon Matzigke",
                     "COPYRIGHT SIGN\n    0x0098: 0x038f,     #  GREEK CAPITAL LETTER OMEGA WITH TONOS\n    0x0099: 0x00b2,     #  SUPERSCRIPT TWO\n    0x009a: 0x00b3,     #  SUPERSCRIPT THREE\n    0x009b: 0x03ac,     #  GREEK",
                     "Copyright (c) 2008 Red Hat, Inc.\n   \n   SuperH Foreign Function Interface \n",
                     "copyright message\nand disclaimer are retained in their original form.",
                     "Copyright (C) 2001-2006 Python Software Foundation\n# Author: Ben Gertzfield\n# Contact: email-sig@python.org",
                     "Copyright 2008 Armin Ronacher.\n",
                     "Copyright (C) YEAR ORGANIZATION\n# FIRST AUTHOR <EMAIL@ADDRESS>, YEAR.",
                     "Copyright (c) 1998, 2008, 2011  Red Hat, Inc.\n\t   \n   ARM Foreign Function Interface \n",
                     "copyright: 2008, 2009, 2010 by Georg Brandl.\n",
                     "Copyright (C) 1995-2003 Jean-loup Gailly.\n * For conditions of distribution and use, see copyright notice in zlib.h\n */\n",
                     "Copyright (c) 1996, 1997, 2003, 2004, 2008  Red Hat, Inc.\n   \n   SPARC Foreign Function Interface \n",
                     "COPYRIGHT SIGN\n    u'\\u2122'   #  0xAA -> TRADE MARK SIGN\n    u'\\u0119'   #  0xAB -> LATIN SMALL LETTER E WITH OGONEK\n    u'\\xa8'     #  0xAC -> DIAERESIS\n    u'\\u2260'   #  0xAD -> NOT EQUAL TO\n    u",
                     "Copyright (c) 2007, 2009, 2010  Red Hat, Inc.\n",
                     "copyright string.  This is updated manually.",
                     "Copyright (c) 2006 Free Software Foundation, Inc.\n   derived from unix64.S",
                     "Copyright (c) 2012  Alexandre K. I. de Mendonca <alexandre.keunecke@gmail.com>",
                     "(c) Copyright 2000 Guido van Rossum.",
                     "Copyright (c) 1999 by Fredrik Lundh",
                     "Copyright (c) 2000-2010, eGenix.com Software GmbH; mailto:info@egenix.com",
                     "Copyright (c) 2002, 2003, 2004, 2006, 2008 Kaz Kojima\n   \n   SuperH Foreign Function Interface \n",
                     "Copyright (C) 2007, 2009, 2010 Free Software Foundation, Inc\n",
                     "Copyright (c) 2006 Free Software Foundation, Inc.\n",
                     "Copyright (c) 2000 BeOpen.com.\nAll rights reserved",
                     "Copyright (c) 2012 Thorsten Glaser",
                     "Copyright (c) 1998, 1999, 2000 Thai Open Source Software Center Ltd\n                               and Clark Cooper\n",
                     "Copyright (c) 2008 Red Hat, Inc\n \n   S390 Foreign Function Interface\n",
                     "Copyright (c) 1999, 2007, 2008  Red Hat, Inc.",
                     "Copyright (c) 1991-1995 Stichting Mathematisch Centrum.\n",
                     "COPYRIGHT SIGN\n    u'\\u0218'   #  0xAA -> LATIN CAPITAL LETTER S WITH COMMA BELOW\n    u'\\xab'     #  0xAB -> LEFT-POINTING DOUBLE ANGLE QUOTATION MARK\n    u'\\u0179'   #  0xAC -> LATIN CAPITAL LETTER Z",
                     "COPYRIGHT SIGN\n    u'\\u2122'   #  0xAA -> TRADE MARK SIGN\n    u'\\u0402'   #  0xAB -> CYRILLIC CAPITAL LETTER DJE\n    u'\\u0452'   #  0xAC -> CYRILLIC SMALL LETTER DJE\n    u'\\u2260'   #  0xAD -> NOT EQU",
                     "Copyright (C) 2001-2006 Python Software Foundation\n# Author: Barry Warsaw, Thomas Wouters, Anthony Baxter\n# Contact: email-sig@python.org",
                     "Copyright (c) 2009 Alan Jenkins <alan-jenkins@tuffmail.co.uk>\n",
                     "Copyright (c) 2011 Maarten Bosmans <mkbosmans@gmail.com>\n",
                     "Copyright (c) 2004 Renesas Technology\n   \n   M32R Foreign Function Interface \n",
                     "COPYRIGHT = 0\nDVD_CPM_COPYRIGHTED = 1\nDVD_CP_SEC_NONE = 0\nDVD_CP_SEC_EXIST = 1\nDVD_CGMS_UNRESTRICTED = 0\nDVD_CGMS_SINGLE = 2\nDVD_CGMS_RESTRICTED = 3",
                     "Copyright(void)\n{\n\treturn cprt;\n",
                     "(C) Copyright 2001  Autonomous Zone Industries\n",
                     "Copyright (c) 2009, 2010, 2011, 2012 ARM Ltd.\n",
                     "Copyright (C) 2000 Autonomous Zone Industries\n",
                     "Copyright (c) 2001-2006 Gregory P. Ward.  All rights reserved",
                     "Copyright (C) 2008 Red Hat, Inc\n",
                     "Copyright (c) 1996, 2007, 2008, 2011  Red Hat, Inc.\n   \n   MIPS Foreign Function Interface \n",
                     "Copyright (C) 2001,2002 Python Software Foundation\n# email package unit tests",
                     "Copyright (C) 2011 Kyle Moffett\n",
                     "Copyright 1995-1996 by Virginia Polytechnic Institute\nUniversity, Blacksburg, Virginia, USA, and Fred L. Drake, Jr., Reston,\\n\\\n",
                     "Copyright (C) 2006 Python Software Foundation.\n",
                     "Copyright (c) 1996, 2003-2004, 2007-2008 Red Hat, Inc.\n   \n   SPARC Foreign Function Interface \n",
                     "Copyright (c) 1998 Red Hat, Inc.\n;  \n;  ARM Foreign Function Interface \n",
                     "Copyright 2000, Clark Cooper\n",
                     "Copyright (c) 2005-2006 ActiveState Software Inc.\n *\n * Namespace all expat exported symbols to avoid dynamic loading symbol\n * collisions when embedding Python.\n",
                     "Copyright (c) 1999 by Secret Labs AB.\n",
                     "Copyright (C) 2001-2006 Python Software Foundation\n# email package unit tests",
                     "Copyright (c) 2003 Python Software Foundation.\n   All rights reserved",
                     "Copyright (c) 1996, 1998, 1999, 2001  Red Hat, Inc.\n",
                     "copyright{copyright\n  \\def\\dots{...}%\n  \\def\\enddots{...}%\n  \\def\\equiv{==}%\n  \\def\\error{error}%\n  \\def\\euro{euro}%\n  \\def\\expansion{==>}%\n  \\def\\geq{>=}%\n  \\def\\guillemetleft{<<}%\n  \\def\\guillemetri",
                     "copyright}\n  \\DeclareUnicodeCharacter\n  \\DeclareUnicodeCharacter{00AB}{\\guillemetleft}\n  \\DeclareUnicodeCharacter{00AD}{\\-}\n  \\DeclareUnicodeCharacter{00AE}{\\registeredsymbol}\n  \\DeclareUnicodeCharact",
                     "Copyright (C) 1996, 1997, 1998, 1999, 2000, 2001, 2003, 2004, 2005, 2006,\n",
                     "Copyright (C) 2002, 2006, 2007, 2009, 2010 Free Software Foundation, Inc.",
                     "Copyright (C) 2007 Free Software Foundation, Inc",
                     "Copyright (C) 2005-2010 Gerhard H\n *\n * This file is part of pysqlite.\n",
                     "Copyright (C) 2002-2006 Python Software Foundation\n# Contact: email-sig@python.org\n# email package unit tests for (optional) Asian codecs",
                     "Copyright (C) 2005 Martin v. L\n",
                     "Copyright (c) 2010  CodeSourcery\n",
                     "Copyright (c) 2011 Plausible Labs Cooperative, Inc.",
                     "Copyright (c) 2011 Timothy Wall\n",
                     "COPYRIGHT SIGN\n    u'\\u2044'   #  0xDA -> FRACTION SLASH\n    u'\\u20ac'   #  0xDB -> EURO SIGN\n    u'\\u2039'   #  0xDC -> SINGLE LEFT-POINTING ANGLE QUOTATION MARK\n    u'\\u203a'   #  0xDD -> SINGLE RIG",
                     "Copyright (c) 1991 - 1995, Stichting Mathematisch Centrum Amsterdam,\nThe Netherlands.  All rights reserved",
                     "Copyright (c) 1999-2008 by Fredrik Lundh",
                     "Copyright (C) 2012, 2013  Anthony Green\n   \n   Moxie Foreign Function Interface \n",
                     "Copyright (c) 2011 Plausible Labs Cooperative, Inc.\n   \n   ARM Foreign Function Interface \n",
                     "Copyright (c) 2012 Tilera Corp.",
                     "Copyright:          Copyright",
                     "Copyright (c) 2013 Tensilica, Inc.\n   \n   XTENSA Foreign Function Interface \n",
                     "Copyright 2007 Google, Inc. All Rights Reserved",
                     "Copyright (c) 2001 Mark Pilgrim",
                     "copyright 2001, Autonomous Zones Industries, Inc., all rights...\n",
                     "Copyright 1985, 1986, 1988, 1990, 1991, 1992, 1993, 1994, 1995,\n",
                     "copyright = '1990-",
                     "Copyright (c) 1999 by Fredrik Lundh.\n *\n * fredrik@pythonware.com\n * http://www.pythonware.com\n",
                     "Copyright (C) 2001 John Hornkvist\n",
                     "COPYRIGHT SIGN\n    u'\\u038f'   #  0x0098 -> GREEK CAPITAL LETTER OMEGA WITH TONOS\n    u'\\xb2'     #  0x0099 -> SUPERSCRIPT TWO\n    u'\\xb3'     #  0x009a -> SUPERSCRIPT THREE\n    u'\\u03ac'   #  0x009b ",
                     "Copyright (c) 1998, 2007, 2008, 2012 Red Hat, Inc.\n",
                     "Copyright (c) 2005 Axis Communications AB\n",
                     "Copyright (C) 2003-2012 Free Software Foundation, Inc.",
                     "COPYRIGHT SIGN\n    u'\\u0404'   #  0xAA -> CYRILLIC CAPITAL LETTER UKRAINIAN IE\n    u'\\xab'     #  0xAB -> LEFT-POINTING DOUBLE ANGLE QUOTATION MARK\n    u'\\xac'     #  0xAC -> NOT SIGN\n    u'\\xad'     ",
                     "Copyright (C) 1997 - 2002, Makoto Matsumoto and Takuji Nishimura,\n   All rights reserved",
                     "Copyright (c) 2006, Marc-Andre Lemburg (mal@egenix.com). See the\n    documentation for further information on copyrights, or contact\n    the author. All Rights Reserved",
                     "Copyright (C) 2004, 2005, 2007, 2008, 2009 Free Software Foundation,\n#   Inc.\n#   Written by Gary V. Vaughan, 2004",
                     "COPYRIGHT SIGN\n    u'\\u2563'   #  0xB9 -> BOX DRAWINGS DOUBLE VERTICAL AND LEFT\n    u'\\u2551'   #  0xBA -> BOX DRAWINGS DOUBLE VERTICAL\n    u'\\u2557'   #  0xBB -> BOX DRAWINGS DOUBLE DOWN AND LEFT\n   ",
                     "Copyright (C) 1998 Geoffrey Keating\n",
                     "Copyright (c) 2005 Axis Communications AB",
                     "Copyright (c) 1997-2002 by Secret Labs AB \";",
                     "Copyright (c) 2008  Red Hat, Inc\n",
                     "Copyright (C) 1999-2006, 2007, 2008, 2011 Free Software Foundation, Inc.\n#   Written by Thomas Tanner, 1999",
                     "Copyright (C) 2006-2010 Gerhard H\n *\n * This file is part of pysqlite.\n",
                     "Copyright (C) 1995-2005 Mark Adler\n * For conditions of distribution and use, see copyright notice in zlib.h\n */\n",
                     "Copyright (c) 2011, 2013 Anthony Green\n",
                     "Copyright (C) 2007, 2008 Free Software Foundation, Inc\n",
                     "Copyright 1991-1997 by Stichting Mathematisch Centrum, Amsterdam,\nThe Netherlands.",
                     "Copyright (c) 1995-2000 Corporation for National Research Initiatives.\nAll rights reserved",
                     "COPYRIGHT = \\\n          'Type \"copyright\", \"credits\" or \"license()\" for more information.",
                     "Copyright (C) 2002 Lars Gust\n",
                     "Copyright (C) 1999-2001 Gregory P. Ward.\n",
                     "Copyright (c) 2009 Guido U. Draheim <guidod@gmx.de>\n",
                     "Copyright (c) 2002  Gustavo Niemeyer <niemeyer@conectiva.com>\n",
                     "copyright string in the executable of your product.\n */\n",
                     "Copyright (c) 2000 Hewlett Packard Company\n   \n   IA64/unix Foreign Function Interface \n",
                     "Copyright (c) 1999-2002 by Fredrik Lundh",
                     "Copyright (c) 1997-2001 by Total Control Software\n#                         All Rights Reserved",
                     "Copyright (C) 2001-2007 Python Software Foundation\n# Contact: email-sig@python.org\n# email package unit tests",
                     "Copyright (C) 2007  Free Software Foundation, Inc\n",
                     "Copyright (c) 1996-2003  Red Hat, Inc.\n",
                     "Copyright (C) 2007  Free Software Foundation, Inc.\n",
                     "Copyright (c) 1998, 1999, 2000 Thai Open Source Software Center Ltd\n   See the file COPYING for copying permission.\n*/\n",
                     "Copyright (c) 2008 Red Hat, Inc.\n   derived from unix64.S",
                     "Copyright (C) 2002-2006 Python Software Foundation\n# Author: Barry Warsaw\n# Contact: email-sig@python.org",
                     "copyright: 2007-2008 by Georg Brandl.\n",
                     "Copyright (c) 1999-2002 by Secret Labs AB.\n",
                     "Copyright (c) 1996, 1998, 2001, 2002, 2003  Red Hat, Inc.\n   \n   X86 Foreign Function Interface \n",
                     "Copyright (C) 2004-2005 Gerhard H",
                     "(C)\n        self.check_basic_ref(create_function)\n        self.check_basic_ref(create_bound_method)\n        self.check_basic_ref(create_unbound_method)",
                     "Copyright (C) 2001-2010 Python Software Foundation\n# Contact: email-sig@python.org",
                     "Copyright (C) 2001-2006 Python Software Foundation\n# Author: Ben Gertzfield, Barry Warsaw\n# Contact: email-sig@python.org",
                     "Copyright (c) 2011, 2012  Anthony Green\n"
                  ],
                  "componentProjectName": "Python programming language"
               },
@jfcherng
Copy link
Contributor

-with open(args.output_file_html, 'w+') as fh:
-    with open(args.json_file, 'r') as lj:
+with open(args.output_file_html, 'w+', encoding='utf-8') as fh:
+    with open(args.json_file, 'r', encoding='utf-8') as lj:

should fix it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants