P-12-16-19-1

Stendhal
Collection

Cautionaries are simply edits to the original content for the purposes of improving the usability and clarity of the informatic design.  Edits should focus on identifying the framework of the original content in its entirety, including redundant messages of cultural or legal significance.  The following edits were made to the content to improve the framework:
  1. Words were stemmed.
  2. Stop Words were used.
  • The Stop Word List: 'a', 'about', 'above', 'above', 'across', 'after', 'afterwards', 'again', 'against', 'all', 'almost', 'alone', 'along', 'already', 'also','although','always','am','among', 'amongst', 'amoungst', 'amount',  'an', 'and', 'another', 'any','anyhow','anyone','anything','anyway', 'anywhere', 'are', 'around', 'as',  'at', 'back','be','became', 'because','become','becomes', 'becoming', 'been', 'before', 'beforehand', 'behind', 'being', 'below', 'beside', 'besides', 'between', 'beyond', 'bill', 'both', 'bottom','but', 'by', 'call', 'can', 'cannot', 'cant', 'co', 'con', 'could', 'couldnt', 'cry', 'de', 'describe', 'detail', 'do', 'done', 'down', 'due', 'during', 'each', 'eg', 'eight', 'either', 'eleven','else', 'elsewhere', 'empty', 'enough', 'etc', 'even', 'ever', 'every', 'everyone', 'everything', 'everywhere', 'except', 'few', 'fifteen', 'fify', 'fill', 'find', 'fire', 'first', 'five', 'for', 'former', 'formerly', 'forty', 'found', 'four', 'from', 'front', 'full', 'further', 'get', 'give', 'go', 'had', 'has', 'hasnt', 'have', 'he', 'hence', 'her', 'here', 'hereafter', 'hereby', 'herein', 'hereupon', 'hers', 'herself', 'him', 'himself', 'his', 'how', 'however', 'hundred', 'ie', 'if', 'in', 'inc', 'indeed', 'interest', 'into', 'is', 'it', 'its', 'itself', 'keep', 'last', 'latter', 'latterly', 'least', 'less', 'ltd', 'made', 'many', 'may', 'me', 'meanwhile', 'might', 'mill', 'mine', 'more', 'moreover', 'most', 'mostly', 'move', 'much', 'must', 'my', 'myself', 'name', 'namely', 'neither', 'never', 'nevertheless', 'next', 'nine', 'no', 'nobody', 'none', 'noone', 'nor', 'not', 'nothing', 'now', 'nowhere', 'of', 'off', 'often', 'on', 'once', 'one', 'only', 'onto', 'or', 'other', 'others', 'otherwise', 'our', 'ours', 'ourselves', 'out', 'over', 'own','part', 'per', 'perhaps', 'please', 'put', 'rather', 're', 'same', 'see', 'seem', 'seemed', 'seeming', 'seems', 'serious', 'several', 'she', 'should', 'show', 'side', 'since', 'sincere', 'six', 'sixty', 'so', 'some', 'somehow', 'someone', 'something', 'sometime', 'sometimes', 'somewhere', 'still', 'such', 'system', 'take', 'ten', 'than', 'that', 'the', 'their', 'them', 'themselves', 'then', 'thence', 'there', 'thereafter', 'thereby', 'therefore', 'therein', 'thereupon', 'these', 'they', 'thick', 'thin', 'third', 'this', 'those', 'though', 'three', 'through', 'throughout', 'thru', 'thus', 'to', 'together', 'too', 'top', 'toward', 'towards', 'twelve', 'twenty', 'two', 'un', 'under', 'until', 'up', 'upon', 'us', 'very', 'via', 'was', 'we', 'well', 'were', 'what', 'whatever', 'when', 'whence', 'whenever', 'where', 'whereafter', 'whereas', 'whereby', 'wherein', 'whereupon', 'wherever', 'whether', 'which', 'while', 'whither', 'who', 'whoever', 'whole', 'whom', 'whose', 'why', 'will', 'with', 'within', 'without', 'would', 'yet', 'you', 'your', 'yours', 'yourself', 'yourselves', 'the'.

  • The Reasoning Behind the Selection - These words are of high frequency, non-unique generality.  They are simply removed to clarify the content, of a more unique terminology, during the analytic stage of modeling.  There are other words that could be included or excluded, as the method of removal isn’t intended to be exact.  However, the terms should be non-unique, of high frequency, and fully disclosed to users of the informatic model.  That is, these terms after the analytic stage are returned to the informatic model in developing the networks, layering, directionality, and detailing of the model. 
  • Implications of Selection - The methodology generalizes the unstructured information, so regardless of the nuanced changes of a stop word list; which may or may not include some unique terms, or may or may not meet a particular standard asserted as ideal; the given methodology returns these words to the corpus for the informatic modelling, and the generalized form of significant associations are consistently accounted for, even if some words of significant association were treated as stop words initially.  That is, there isn't a perfect stop word list, and lists will vary, but the informatic methodology manages these variations for a consistent outcome, so long as most non-unique terminology is removed.  


Specific Cautionaries

The following cautionaries are more specific to the Stendhal - Collection
  • There were a large variety of numbers and number-letter combinations that marked news sections. All numbers, letter-number combinations not constituting words or abbreviations were removed after the analytic modeling stage.  Some low-frequency of numbers meshing with words were removed as well.  All combinations were removed to improve the usability and clarity of the content being modeled informatically.
  • No words were removed, other than what is listed on the Stop Word list.  These words were removed only for the framing and analytic stages.  Words are returned during the network, layering, and detailing stages of modeling. 
  • Errors involving the content, such as conversion errors of words are not edited and will remain transparent to viewers of the model.  The focus is on developing trust through process and procedure, not through avenues easily manipulated, such as finely-threaded performances of perfection and cosmetic appeal.  Exceptions will be listed in the "specific edits" section.   
  • Split words that are merged back together, if any, will be listed in specific edits.
  • The userability standard is used moderately.  That is, terms like "ebook", or proper nouns, such as publisher names, or any other term reflective of the overall publication, will likely be included into the modeling process.  The models are designed to account for terms that work in different contexts, such as publication terms, that will be presented alongside the design of the actual written work, with the ideas of the given author intact.  
  • This methodology is designed to manage the unstructured informational environment, of a sound and consistent overall design, that manifests from categorical arrangements that are inconsistent and imperfect, like that of a hair style.  Even though terms, these individual hairs, will change, the overall styling, the informatic model, will remain largely the same, of a consistent arrangement of major nodes.  In this way, the unstructured informational environment differs from the structured informational environment.  

Specific Edits


0 0 0 0 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 10 10 10 10 10 10 10 10 10 10 10 10 10 10 10 100 100 101 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 102 103 105 105 106 107 107 1079 108 108 109 109 1095 10th 11 11 11 11 11 11 11 11 11 11 11 11 11 11 11 11 11 11 11 110 110 1101 111 112 112 113 113 114 114 1144 115 115 116 116 117 117 118 118 119 119 1194 11th 12 12 12 12 12 12 12 12 12 12 12 12 12 12 12 12 12 12 120 120 121 121 1214 122 122 1226 123 123 124 124 125 125 125 1252 126 127 127 128 128 129 129 13 13 13 13 13 13 13 13 13 13 13 13 13 13 13 130 130 131 132 133 1335 134 134 134 1342 135 135 136 137 138 139 139 1396 13th 14 14 14 14 14 14 14 14 14 14 14 14 14 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 140 141 141 142 1423 143 1433 144 146 146 1461 1461 1469 147 148 149 149 15 15 15 15 15 15 15 15 15 15 15 150 150 151 152 1520 153 1533 154 1546 1547 155 155 1550 1552 1552 1562 157 1572 1572 1573 1574 158 158 1583 159 1591 16 16 16 16 16 16 160 1606 161 162 162 1623 163 1632 1639 164 1647 165 1651 1651 1653 166 1663 167 1671 1672 1675 1676 1676 168 1685 169 1694 17 17 17 17 17 17 17 17 170 170 171 171 1710 1711 1713 1715 172 1725 173 1736 1737 1739 1743 1743 1746 1747 1749 175 1754 1754 1754 1755 1759 1759 176 1760 1761 1762 1762 1763 1765 1766 1766 1767 1769 177 177 1771 1771 1772 1774 1774 1775 178 1780 1780 1782 1783 1783 1784 1784 1784 1789 179 1790 1791 1791 1792 1793 1793 1796 1796 1796 1797 1797 1799 17th 17th 18 18 18 18 18 18 18 180 1800 1801 1803 1804 1807 1807 1807 1808 1808 1809 181 1810 1810 1810 1811 1811 1811 1812 1812 1813 1813 1814 1814 1814 1815 1815 1815 1815 1815 1816 1816 1816 1817 1817 1817 1817 1818 1818 1818 1818 1818 1818 1819 1819 1819 1819 1819 1819 1819 1819 1819 1819 1819 1819 1819 1819 182 1820 1820 1820 1820 1820 1820 1820 1820 1820 1820 1820s 1821 1821 1821 1821 1821 1821 1821 1821 1821 1821 1822 1822 1822 1822 1823 1824 1825 1825 1826 183 1830 1830 1836 1838 1839 184 184 1841 1842 1842 185 1854 186 1861 187 1876 188 1888 1889 189 1890 1892 1894 1897 18th 19 19 19 19 19 19 19 19 19 19 190 190 1900 1900 1906 1907 1908 1908 191 192 193 194 195 196 197 198 199 1997 19th 1st 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 20 20 20 20 20 20 20 20 20 0 0 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 10 10 10 10 10 10 10,000 10,000 10,600 102 103 1079 11 11 11 11 11 11 11 1101 112 115 1170 118 119 119 12 12 12 12 12 121 1214 122 1226 123 125 1252 129 13 13 13 131 132 134 134 134 135 137 1396 13th 14 14 14 14 14 141 141 1423 1429 143 1433 146 146 1461 1462 1469 147 149 149 15 15 15 15 15 15 150 150 151 152 1520 153 1530 1533 154 1546 155 155 1552 1559 1562 1573 158 158 1583 1591 16 16 16 16 160 1600 1606 162 162 1632 1639 1647 1651 1653 1653 166 1663 1672 1675 1694 1699 17 17 17 17 17 17 170 170 171 1710 1711 1713 1715 172 173 1739 1740 1743 1743 1746 1747 1749 1754 1754 1755 1759 1759 1759 176 1761 1762 1763 1766 1766 1767 1769 1769 1769 177 177 1770 1771 1772 1774 1775 1780 1783 1784 1784 1787 1788 1789 1789 1792 1793 1793 1796 1796 1799 18 18 180 1801 1803 1806 1807 1808 1808 1808 181 1810 1810 1811 1812 1812 1814 1814 1814 1814 1814 1815 1815 1815 1815 1816 1816 1816 1817 1817 1817 1817 1818 1818 1818 1819 1819 1819 1819 1819 1819 182 1820 1820 1820 1820 1820 1820 1820 1820 1821 1821 1821 1822 1822 1822 1824 1825 1825 1826 1826 1827.2 1828 1829 183 1830 1830 1830 1830 1830 1834 1835 184 184 185 1853 1876 188 1880 1888 1890 1897 19 19 19 19 19 19 19 190 1900 1908 195 196 197 197 19th 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 20 20 20,600 200 200 2002040798 2016 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 202 208 21 21 21 21 21 21 212 213 213 214 215 217 217 218 22 22 22 22 222 222 223 225 227 23 23 230 234 235 236 238 24 24 240 241 245 246 247 25 25 254 254 25th 26 26 26 26 26 264 264 265 266 267 267 268 27 27 274 275 276 278 279 28 28 28 280 281 283 284 285 286 287 289 289 297 299 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 30 30 30 30 30 302 304 307 309 31 31 31 312 314 317 32 32 320 321 323 325 326 329 33 33 33 331 332 332 334 335 336 337 339 339 342 35 35 36 371 385 39 39 39 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 40 406 41 42 424 43 43 43 44 44 446 45 45 46 466 47 47 47 48 48 486 49 490 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 50 50 503 518 52 52 53 53 53720 55 55 56 57 57 57 58 59 59 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 60 62 62 62 63 63 64 66 69 7 7 7 7 7 7 7 7 7 7 7 7 7 70 70 71 71 73 73 77 79 7on 7th 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 81 83 83 84 85 87 88 89 89 8s 9 9 9 9 9 9 9 9 9 9 9 9 9 9 9 90 94 95 98 9dit 9g 9lan 9phine 9r 200 200 2002040798 2003 201 2016 2019 202 202 204 205 207 208 209 20th 20th 21 21 21 21 21 21 210 211 212 213 213 214 215 216 217 218 219 22 22 22 22 220 221 222 222 223 225 226 227 228 229 23 23 23 23 23 23 23 23 23 230 231 232 233 234 235 236 238 239 23rd 23rd 23rd 23rd 24 24 24 24 24 24 24 240 241 242 245 246 247 248 249 24th 25 25 25 25 25 25 25 251 252 253 254 255 256 257 258 259 25th 26 26 26 26 26 26 26 261 262 263 264 265 266 267 267 268 269 27 27 27 27 27 27 271 272 273 274 275 275 276 277 278 279 28 28 28 28 28 28 28 280 283 284 285 286 287 289 29 29 29 290 290 292 293 294 297 298 299 29th 2nd 2nd 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 30 30 30 30 30 30 30 300 301 302 303 304 305 305 306 307 308 309 30th 31 31 31 31 31 31 310 312 314 315 316 317 318 32 32 32 32 32 32 32 320 321 323 325 326 327 328 329 33 33 33 330 331 332 332 333 334 335 336 336 337 338 339 339 34 34 34 34 342 342 343 344 346 347 35 35 35 35 356 36 36 36 37 37 37 38 38 38 385 39 39 39 3rd 3rd 3rd 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 40 40 40 406 41 41 42 42 42 424 43 43 43 43 43 43 43 44 44 44 44 446 45 45 45 45 46 46 46 46 466 47 47 47 47 47 48 48 48 48 48 486 49 49 49 491 495 4th 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 5 50 50 50 503 503 51 511 518 52 53 53 53 53 53 537 5372 5372 53720 53720 53720 53720 53720 53720 53720 53720 53720 53720 53720 54 54 54 55 55 55 55 56 56 56 56 57 57 57 57 57 576 576 5763 5763 57638 57638 57638 57638 57638 57638 57638 57638 57638 58 58 58836 59 59 59 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 6 60 60 61 61 61 62 62 62 62 63 63 63 63 64 64 65 65 65 66 66 67 68 68 68 69 69 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 70 70 70 71 71 71 71 71 72 72 73 73 73 74 74 747 75 75 76 76 77 77 78 78 79 79 7th 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 8 80 800 81 81 82 82 83 84 84 843 85 86 86 87 87 88 88 89 89 9 9 9 9 9 9 9 9 9 9 9 9 9 9 9 9 9 9 9 9 9 9 9 90 90 91 91 92 92 92 93 93 94 94 95 96 96 96 96 97 98 98 99 99