P-1-30-20-1

George Byron
Collection
Cautionaries are simply edits to the original content for the purposes of improving the usability and clarity of the informatic design.  Edits should focus on identifying the framework of the original content in its entirety, including redundant messages of cultural or legal significance.  The following edits were made to the content to improve the framework:
  1. Words were stemmed.
  2. Stop Words were used.
  • The Stop Word List: 'a', 'about', 'above', 'above', 'across', 'after', 'afterwards', 'again', 'against', 'all', 'almost', 'alone', 'along', 'already', 'also','although','always','am','among', 'amongst', 'amoungst', 'amount',  'an', 'and', 'another', 'any','anyhow','anyone','anything','anyway', 'anywhere', 'are', 'around', 'as',  'at', 'back','be','became', 'because','become','becomes', 'becoming', 'been', 'before', 'beforehand', 'behind', 'being', 'below', 'beside', 'besides', 'between', 'beyond', 'bill', 'both', 'bottom','but', 'by', 'call', 'can', 'cannot', 'cant', 'co', 'con', 'could', 'couldnt', 'cry', 'de', 'describe', 'detail', 'do', 'done', 'down', 'due', 'during', 'each', 'eg', 'eight', 'either', 'eleven','else', 'elsewhere', 'empty', 'enough', 'etc', 'even', 'ever', 'every', 'everyone', 'everything', 'everywhere', 'except', 'few', 'fifteen', 'fify', 'fill', 'find', 'fire', 'first', 'five', 'for', 'former', 'formerly', 'forty', 'found', 'four', 'from', 'front', 'full', 'further', 'get', 'give', 'go', 'had', 'has', 'hasnt', 'have', 'he', 'hence', 'her', 'here', 'hereafter', 'hereby', 'herein', 'hereupon', 'hers', 'herself', 'him', 'himself', 'his', 'how', 'however', 'hundred', 'ie', 'if', 'in', 'inc', 'indeed', 'interest', 'into', 'is', 'it', 'its', 'itself', 'keep', 'last', 'latter', 'latterly', 'least', 'less', 'ltd', 'made', 'many', 'may', 'me', 'meanwhile', 'might', 'mill', 'mine', 'more', 'moreover', 'most', 'mostly', 'move', 'much', 'must', 'my', 'myself', 'name', 'namely', 'neither', 'never', 'nevertheless', 'next', 'nine', 'no', 'nobody', 'none', 'noone', 'nor', 'not', 'nothing', 'now', 'nowhere', 'of', 'off', 'often', 'on', 'once', 'one', 'only', 'onto', 'or', 'other', 'others', 'otherwise', 'our', 'ours', 'ourselves', 'out', 'over', 'own','part', 'per', 'perhaps', 'please', 'put', 'rather', 're', 'same', 'see', 'seem', 'seemed', 'seeming', 'seems', 'serious', 'several', 'she', 'should', 'show', 'side', 'since', 'sincere', 'six', 'sixty', 'so', 'some', 'somehow', 'someone', 'something', 'sometime', 'sometimes', 'somewhere', 'still', 'such', 'system', 'take', 'ten', 'than', 'that', 'the', 'their', 'them', 'themselves', 'then', 'thence', 'there', 'thereafter', 'thereby', 'therefore', 'therein', 'thereupon', 'these', 'they', 'thick', 'thin', 'third', 'this', 'those', 'though', 'three', 'through', 'throughout', 'thru', 'thus', 'to', 'together', 'too', 'top', 'toward', 'towards', 'twelve', 'twenty', 'two', 'un', 'under', 'until', 'up', 'upon', 'us', 'very', 'via', 'was', 'we', 'well', 'were', 'what', 'whatever', 'when', 'whence', 'whenever', 'where', 'whereafter', 'whereas', 'whereby', 'wherein', 'whereupon', 'wherever', 'whether', 'which', 'while', 'whither', 'who', 'whoever', 'whole', 'whom', 'whose', 'why', 'will', 'with', 'within', 'without', 'would', 'yet', 'you', 'your', 'yours', 'yourself', 'yourselves', 'the'.

  • The Reasoning Behind the Selection - These words are of high frequency, non-unique generality.  They are simply removed to clarify the content, of a more unique terminology, during the analytic stage of modeling.  There are other words that could be included or excluded, as the method of removal isn’t intended to be exact.  However, the terms should be non-unique, of high frequency, and fully disclosed to users of the informatic model.  That is, these terms after the analytic stage are returned to the informatic model in developing the networks, layering, directionality, and detailing of the model. 
  • Implications of Selection - The methodology generalizes the unstructured information, so regardless of the nuanced changes of a stop word list; which may or may not include some unique terms, or may or may not meet a particular standard asserted as ideal; the given methodology returns these words to the corpus for the informatic modelling, and the generalized form of significant associations are consistently accounted for, even if some words of significant association were treated as stop words initially.  That is, there isn't a perfect stop word list, and lists will vary, but the informatic methodology manages these variations for a consistent outcome, so long as most non-unique terminology is removed.  


Specific Cautionaries

The following cautionaries are more specific to the Byron - Collection
  • There were a large variety of numbers and number-letter combinations that marked news sections. All numbers, letter-number combinations not constituting words or abbreviations were removed after the analytic modeling stage.  Some low-frequency of numbers meshing with words were removed as well.  All combinations were removed to improve the usability and clarity of the content being modeled informatically.
  • No words were removed, other than what is listed on the Stop Word list.  These words were removed only for the framing and analytic stages.  Words are returned during the network, layering, and detailing stages of modeling. 
  • Errors involving the content, such as conversion errors of words are not edited and will remain transparent to viewers of the model.  The focus is on developing trust through process and procedure, not through avenues easily manipulated, such as finely-threaded performances of perfection and cosmetic appeal.  Exceptions will be listed in the "specific edits" section.   
  • Split words that are merged back together, if any, will be listed in specific edits.
  • The userability standard is used moderately.  That is, terms like "ebook", or proper nouns, such as publisher names, or any other term reflective of the overall publication, will likely be included into the modeling process.  The models are designed to account for terms that work in different contexts, such as publication terms, that will be presented alongside the design of the actual written work, with the ideas of the given author intact.  
  • This methodology is designed to manage the unstructured informational environment, of a sound and consistent overall design, that manifests from categorical arrangements that are inconsistent and imperfect, like that of a hair style.  Even though terms, these individual hairs, will change, the overall styling, the informatic model, will remain largely the same, of a consistent arrangement of major nodes.  In this way, the unstructured informational environment differs from the structured informational environment.  

Specific Edits

0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1.80 10 10 10 100 1000 1001 1002 1003 1005 1006 1007 1008 1009 101 101 1010 1013 1014 1015 1018 1019 102 102 1020 1021 1023 1024 1025 1026 1027 1029 103 103 1030 1031 1032 1033 1034 1035 1036 1037 1038 104 104 1040 1041 1042 1043 1044 1045 1046 1047 1048 1049 105 105 1050 1051 1054 1056 1057 1058 106 1060 1061 1063 1065 1066 1067 1068 107 1070 1072 1074 1075 1076 1078 1079 108 1080 1081 1082 1083 1084 1085 1087 1088 109 109 1090 1091 1092 1094 1095 1097 1099 11 11 11 11 11 11 11 110 110 1100 1101 1104 1106 1108 1109 111 111 1110 1111 1112 1113 1114 1115 1117 1118 1119 112 112 1120 1121 1122 1123 1124 1125 1126 113 113 1131 1132 1133 1134 1135 1136 1137 1139 114 114 114 1140 1142 1143 1145 1148 1149 115 115 1150 1151 1152 1153 1154 1155 1156 1157 1158 1159 116 116 1160 1161 1162 1163 1164 1165 1166 1167 1169 117 1170 1171 1172 1173 1174 1176 1177 1178 1179 118 118 1180 1181 1182 1184 1185 1186 1187 1188 1189 119 1190 1192 1194 1195 1196 12 12 12 120 120 1200 1201 1203 1204 1205 1207 121 1210 1211 1212 1213 1215 1218 1219 122 122 1221 1222 1225 1227 1229 123 123 1230 1231 1232 1237 1238 124 1241 1243 1248 1249 125 125 1250 1251 1252 1253 1254 1255 1256 1257 1259 126 126 1260 1261 1262 1263 1264 1265 1266 1267 1268 1269 127 127 1270 1271 1272 1273 1274 1275 1276 1277 1278 1279 128 128 1280 1281 1282 1283 1284 1287 1288 1289 129 1290 1291 1292 1293 1294 1295 1296 1297 1298 1299 13 13 13 1300 1301 1302 1303 1304 1305 1306 1307 1308 1309 131 1310 1311 1312 1313 1314 1315 1316 1317 1319 132 132 1320 1321 1322 1323 1324 1325 1326 1327 1328 1329 133 133 133 1330 1331 1332 1333 1334 1335 1336 1337 1338 1339 134 134 134 1340 1341 1342 1343 1344 1345 1346 1347 1349 135 135 1350 1351 1352 1353 1354 1355 1356 1357 1358 1359 136 136 1360 1361 137 137 138 138 139 139 139 14 14 14 14 140 140 141 141 142 142 143 144 144 145 145 146 146 147 147 148 148 149 149 15 15 15 15,1807 150 150 151 151 152 152 153 153 154 154 155 155 156 157 157 158 158 159 159 16 16 16 16 16 16 160 160 161 161 162 162 163 163 164 165 165 166 166 167 167 168 168 169 169 17 17 17 17 17 170 171 171 172 173 173 174 175 175 176 177 177 178 178 1788 179 1798 1799 18 18 18 18 180 180 1802 1803 1803 1804 1804 1805 1806 1806 1807 1807 1807 1808 1808 1809 1809 1809 1809 1809 181 1810 1810 1811 1811 1811 1811 1811 1811 1811 1812 1812 1813 1813 1813 1813 1814 1814 1814 1814 1815 1816 1816 1816 1816 1816 1816 1817 1817 1817 1818 1818 1818 1819 1819 182 182 1820 1820 1820 1821 1821 1822 1824 183 1833 1834 184 184 185 185 1852 186 187 187 188 188 1885 189 189 1896 19 19 19 19 19 190 190 1903 191 191 1916 1916 192 192 193 1934 194 194 1942 195 195 196 196 1968 197 197 198 1982 1983 1989 199 199 1994 1997 1st 2 2 2 2 2 2 2 2 2 2 2 2 2 2 20 20 20 20 20 200 200 200 2002 2002 2003 2004 2004 2005 2005 2005 2006 2006 2007 2007 2008 2008 2008 201 201 201 2010 2010 2011 2011 2011 2012 2012 2019 202 203 204 205 205 206 206 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 207 208 209 21 21 21 21 210 211 213 214 215 217 21700 21700 21700 21700 21700 21700 218 219 22 22 22 22 22 22 22 220 221 222 223 225 226 227 228 229 23 23 23 23 230 231 232 233 234 237 239 24 24 24 24 24 240 241 242 243 244 245 247 249 25 25 25 25 250 253 255 256 257 258 26 26 26 260 261 262 263 265 266 267 269 27 27 27 270 271 272 273 276 277 278 279 28 28 28 281 282 283 284 285 287 288 289 29 29 29 29 290 291 293 294 295 297 298 299 2nd 2shall 2tis 3 3 3 3 3 3 3 3 3 30 30 30 30 301 303 305 306 307 308 31 31 31 31 310 311 312 313 314 314 315 316 317 318 319 32 32 32 320 321 322 324 325 326 328 329 33 33 33 330 331 332 333 337 34 340 341 342 343 344 345 346 348 35 35 350 351 352 354 355 356 357 358 359 36 36 36 360 361 364 366 367 368 37 37 37 37 37 370 371 372 373 375 377 378 379 38 38 38 38 380 382 383 384 385 389 39 39 39 39 391 392 394 395 396 397 398 3rd 3rd 4 4 4 4 4 4 4 4 4 4 40 40 40 40 400 402 403 404 405 407 408 41 41 41 41 410 411 412 413 414 415 417 418 419 42 42 42 420 423 424 426 427 428 429 43 43 43 432 433 435 436 438 439 44 44 440 441 442 443 444 445 446 447 449 45 45 450 451 452 453 454 455 456 457 458 459 46 46 461 462 464 465 466 467 468 469 47 47 470 471 472 473 474 475 476 477 48 48 481 482 485 486 487 489 49 49 490 492 494 495 497 498 5 5 5 5 5 50 50 500 501 502 503 504 505 506 507 509 51 51 511 512 513 514 518 519 52 52 521 526 527 528 53 53 530 533 534 536 538 539 54 54 540 540 542 543 545 546 547 549 55 55 550 554 555 556 557 559 56 56 560 561 562 563 564 565 566 567 568 569 57 570 571 573 573 574 575 576 577 578 58 58 58 580 581 585 586 587 589 59 59 592 593 594 598 6 6 6 6 6 6 6 60 60 60 60 60 600 601 602 603 604 605 607 608 609 61 61 610 611 612 613 614 616 617 618 619 62 620 621 622 623 624 625 626 627 628 629 63 63 630 631 632 633 634 635 636 638 639 64 640 641 642 643 644 645 646 647 648 649 65 65 650 651 652 654 655 656 657 658 659 66 66 661 662 663 664 665 666 667 668 669 67 67 670 671 672 673 674 675 676 677 678 679 68 68 680 681 682 684 685 686 689 69 69 690 692 695 696 699 6th 6th 7 7 7 7 7 7 70 70 700 701 702 706 707 709 71 71 710 0 012 1 1 1 1 1,000 10 1002 1019 1020 1023 1025 1025 103 1031 1033 1042 1043 1046 105 1065 1074 1075 1078 1080 109 1094 11 11 11 11 1109 1123 1126 1126 1132 1137 1139 114 1142 115 1150 1154 116 1160 1160 1161 1167 1172 1173 118 1187 1190 1194 12 120 1200 1201 1203 1225 1231 1253 1256 126 1260 1268 1280 1291 1295 13 1302 1319 1321 1329 133 133 1333 1334 1335 1337 1339 134 1340 1342 1343 1344 1345 1346 1347 1358 1359 1360 137 14 14 14 141 142 148 15 15 15,1807 153 154 158 16 16 16 160 162 163 163 165 168 168 169 17 17 17 17 1700 175 1779 178 1788 1798 1799 18 180 1802 1803 1804 1805 1805 1805 1806 1806 1807 1807 1808 1808 1809 1809 1810 1810 1810 1811 1811 1811 1812 1812 1813 1813 1814 1815 1815 1816 1816 1816 1816 1817 1817 1817 1818 1819 182 1820 1820 1820 1822 1823 1824 1830 1833 1834 184 187 1885 1896 19 19 1903 1908 191 1916 1922 1934 1942 1944 1968 1971 1982 1983 1986 1988 1988 1996 1996 1997 19th 1st 2 2 2 2 200 200 2002 2003 2004 2004 2005 2005 2006 2006 2007 2007 2008 201 2010 2010 2011 2011 2012 2012 2019 2019 2019 2019 206 207 21 21 211 213 214 215 21700 21700 21700 22 22 225 226 228 230 231 234 237 24 241 247 249 25 26 26 263 265 265 266 278 281 284 285 287 29 2nd 2shall 2tis 3 3 3 3 3 30 305 31 31 310 312 313 314 319 32 32 320 324 33 330 344 344 345 346 35 35 351 359 36 36 36 360 366 366 37 37 372 389 39 390 391 3rd 4 4 4 4 4 40 405 41 41 414 415 419 42 43 435 438 44 450 458 459 46 464 465 466 467 47 482 489 49 492 495 5 50 511 514 534 536 538 540 540 559 56 561 562 563 566 567 568 568 569 574 575 578 580 586 586 593 594 5th 6 6 6 60 60 60 607 608 609 61 610 611 612 613 614 616 617 619 623 625 626 627 628 629 63 630 631 631 632 633 634 636 638 640 641 642 643 644 652 656 66 665 666 667 668 669 670 673 675 676 677 679 68 682 685 6gle 6th 7 7 7 700 71 711 719 72 730 730 731 736 74 742 748 75 753 754 755 757 758 759 76 761 764 765 766 767 768 769 770 771 772 773 773 774 775 776 778 779 781 784 786 798 8 8 8 801 807 808 809 811 813 816 819 8209 8209 8209 8209 8209 84 855 86 865 880 881 883 885 886 887 889 89 891 894 895 896 897 898 899 9 9 9.5 900 901 902 905 908 909 910 911 912 914 915 917 919 920 921 923 924 925 926 927 929 93 933 933 934 937 939 942 944 945 950 963 97 974 981 981 983 988 990 996 997 998 711 713 714 715 719 72 72 720 721 722 723 726 727 728 73 730 731 732 733 736 737 739 74 74 742 743 746 747 748 75 75 753 754 755 757 758 759 76 76 760 761 762 763 764 765 766 767 768 769 77 77 770 771 772 773 774 775 776 777 778 779 78 78 781 782 784 785 786 787 788 789 79 79 79 791 793 795 797 798 799 8 8 8 8 8 8 8 8 80 80 800 801 802 804 805 806 807 808 809 81 81 81 810 811 812 813 815 816 817 819 82 82 820 8209 8209 8209 821 822 823 824 825 826 827 828 829 83 830 831 832 833 834 835 836 837 838 839 84 84 840 841 842 843 844 845 846 847 848 849 85 850 851 852 853 854 855 856 857 859 86 86 86 860 861 862 864 865 867 869 87 87 873 874 875 877 878 879 88 880 881 882 883 884 885 8859 886 887 888 889 89 89 89 890 891 892 893 894 895 896 897 898 899 9 9 9 9 9 90 900 901 902 905 906 907 908 909 91 910 911 912 913 914 915 916 917 919 92 92 920 921 922 923 924 925 926 927 928 929 93 93 930 931 932 933 934 935 936 937 938 939 94 94 940 941 942 943 944 945 946 95 95 950 951 953 954 955 956 957 958 96 96 962 963 964 967 968 97 97 970 971 972 973 974 975 976 977 978 979 98 98 980 981 983 984 985 986 987 988 99 990 991 993 996 997 998 999