P-10-14-19-1

Michel de Montaigne
Collection
Cautionaries are simply edits to the original content for the purposes of improving the usability and clarity of the informatic design.  Edits should focus on identifying the framework of the original content in its entirety, including redundant messages of cultural or legal significance.  The following edits were made to the content to improve the framework:
  1. Words were stemmed.
  2. Stop Words were used.
  • The Stop Word List: 'a', 'about', 'above', 'above', 'across', 'after', 'afterwards', 'again', 'against', 'all', 'almost', 'alone', 'along', 'already', 'also','although','always','am','among', 'amongst', 'amoungst', 'amount',  'an', 'and', 'another', 'any','anyhow','anyone','anything','anyway', 'anywhere', 'are', 'around', 'as',  'at', 'back','be','became', 'because','become','becomes', 'becoming', 'been', 'before', 'beforehand', 'behind', 'being', 'below', 'beside', 'besides', 'between', 'beyond', 'bill', 'both', 'bottom','but', 'by', 'call', 'can', 'cannot', 'cant', 'co', 'con', 'could', 'couldnt', 'cry', 'de', 'describe', 'detail', 'do', 'done', 'down', 'due', 'during', 'each', 'eg', 'eight', 'either', 'eleven','else', 'elsewhere', 'empty', 'enough', 'etc', 'even', 'ever', 'every', 'everyone', 'everything', 'everywhere', 'except', 'few', 'fifteen', 'fify', 'fill', 'find', 'fire', 'first', 'five', 'for', 'former', 'formerly', 'forty', 'found', 'four', 'from', 'front', 'full', 'further', 'get', 'give', 'go', 'had', 'has', 'hasnt', 'have', 'he', 'hence', 'her', 'here', 'hereafter', 'hereby', 'herein', 'hereupon', 'hers', 'herself', 'him', 'himself', 'his', 'how', 'however', 'hundred', 'ie', 'if', 'in', 'inc', 'indeed', 'interest', 'into', 'is', 'it', 'its', 'itself', 'keep', 'last', 'latter', 'latterly', 'least', 'less', 'ltd', 'made', 'many', 'may', 'me', 'meanwhile', 'might', 'mill', 'mine', 'more', 'moreover', 'most', 'mostly', 'move', 'much', 'must', 'my', 'myself', 'name', 'namely', 'neither', 'never', 'nevertheless', 'next', 'nine', 'no', 'nobody', 'none', 'noone', 'nor', 'not', 'nothing', 'now', 'nowhere', 'of', 'off', 'often', 'on', 'once', 'one', 'only', 'onto', 'or', 'other', 'others', 'otherwise', 'our', 'ours', 'ourselves', 'out', 'over', 'own','part', 'per', 'perhaps', 'please', 'put', 'rather', 're', 'same', 'see', 'seem', 'seemed', 'seeming', 'seems', 'serious', 'several', 'she', 'should', 'show', 'side', 'since', 'sincere', 'six', 'sixty', 'so', 'some', 'somehow', 'someone', 'something', 'sometime', 'sometimes', 'somewhere', 'still', 'such', 'system', 'take', 'ten', 'than', 'that', 'the', 'their', 'them', 'themselves', 'then', 'thence', 'there', 'thereafter', 'thereby', 'therefore', 'therein', 'thereupon', 'these', 'they', 'thick', 'thin', 'third', 'this', 'those', 'though', 'three', 'through', 'throughout', 'thru', 'thus', 'to', 'together', 'too', 'top', 'toward', 'towards', 'twelve', 'twenty', 'two', 'un', 'under', 'until', 'up', 'upon', 'us', 'very', 'via', 'was', 'we', 'well', 'were', 'what', 'whatever', 'when', 'whence', 'whenever', 'where', 'whereafter', 'whereas', 'whereby', 'wherein', 'whereupon', 'wherever', 'whether', 'which', 'while', 'whither', 'who', 'whoever', 'whole', 'whom', 'whose', 'why', 'will', 'with', 'within', 'without', 'would', 'yet', 'you', 'your', 'yours', 'yourself', 'yourselves', 'the'.

  • The Reasoning Behind the Selection - These words are of high frequency, non-unique generality.  They are simply removed to clarify the content, of a more unique terminology, during the analytic stage of modeling.  There are other words that could be included or excluded, as the method of removal isn’t intended to be exact.  However, the terms should be non-unique, of high frequency, and fully disclosed to users of the informatic model.  That is, these terms after the analytic stage are returned to the informatic model in developing the networks, layering, directionality, and detailing of the model. 
  • Implications of Selection - The methodology generalizes the unstructured information, so regardless of the nuanced changes of a stop word list; which may or may not include some unique terms, or may or may not meet a particular standard asserted as ideal; the given methodology returns these words to the corpus for the informatic modelling, and the generalized form of significant associations are consistently accounted for, even if some words of significant association were treated as stop words initially.  That is, there isn't a perfect stop word list, and lists will vary, but the informatic methodology manages these variations for a consistent outcome, so long as most non-unique terminology is removed.  
Specific Cautionaries

The following cautionaries are more specific to the Montaigne - Collection
  • There were a large variety of numbers and number-letter combinations that marked news sections. All numbers, letter-number combinations not constituting words or abbreviations were removed after the analytic modeling stage.  Some low-frequency of numbers meshing with words were removed as well.  All combinations were removed to improve the usability and clarity of the content being modeled informatically.
  • No words were removed, other than what is listed on the Stop Word list.  These words were removed only for the framing and analytic stages.  Words are returned during the network, layering, and detailing stages of modeling. 
  • Errors involving the content, such as conversion errors of words are not edited and will remain transparent to viewers of the model.  The focus is on developing trust through process and procedure, not through avenues easily manipulated, such as finely-threaded performances of perfection and cosmetic appeal.  Exceptions will be listed in the "specific edits" section.   
  • Split words that are merged back together, if any, will be listed in specific edits.
  • The userability standard is used moderately.  That is, terms like "ebook", or proper nouns, such as publisher names, or any other term reflective of the overall publication, will likely be included into the modeling process.  The models are designed to account for terms that work in different contexts, such as publication terms, that will be presented alongside the design of the actual written work, with the ideas of the given author intact.  
  • This methodology is designed to manage the unstructured informational environment, of a sound and consistent overall design, that manifests from categorical arrangements that are inconsistent and imperfect, like that of a hair style.  Even though terms, these individual hairs, will change, the overall styling, the informatic model, will remain largely the same, of a consistent arrangement of major nodes.  In this way, the unstructured informational environment differs from the structured informational environment.  
Specific Edits

1 1 1 1 1 1 1 1 1 1 1 1 1 1 10 10 10 10 10 10 101 102 1027 103 103 103 103 103 1037 1039 1041 106 106 1062 107 107 1070 108 108 109 1093 1095 10th 11 11 11 1103 1123 1130 1131 114 114 115 115 1151 116 1165 117 118 119 12 12 12 12 12 12 12 12 121 123 124 125 126 127 127 13 13 13 13 13 13 13 13 13 13 13 13 13 1305 135 137 137 137 14 14 14 14 140 141 144 145 147 1498 15 15 15 15 15 15 15 15 15 15 15 150 151 151 1513 153 1530 1533 1536 1536 154 154 155 156 1562 1562 1563 1566 1568 1570 1570 1571 1571 1572 1572 1574 1577 158 1580 1580 1580 1581 1582 1584 1585 1585 1588 1588 1590 1590 16 16 16 16 160 1609 1613 1613 1613 162 1626 164 1689 17 17 17 17 17 17 17 17 17 17 17 17 17 1774 18 18 18 18 18 18 18 181 181 183 183 184 1853 1854 1858 187 1877 1877 188 189 18th 19 19 19 19 19 19 19 19 1900 195 196 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 20 20 20 20 20 20 201 208 21 21 21 21 21 21 21 21 21 22 22 22 22 22 22 22 22 225 22d 23 23 23 23 23 23 23 23 23 238 24 24 24 244 247 249 24th 25 25 25 25 25 25 25 25 253 26 26 26 26 26 26 267 27 27 27 27 27 272 275 28 28 28 29 29 29 29 29 29 29 291 294 2d 3 3 3 3 3 3 3 3 3 3 3 3 3 3 3 30 30 31 317 32 32 32 320 323 33 33 33 34 34 34 34 340 346 35 35 35 35 35 358 36 36 36 36 36 362 37 37 37 372 38 38 38 381 382 383 384 387 387 39 39 39 39 395 397 4 4 4 4 4 4 4 4 4 4 4 40 40 40 404 41 414 42 425 43 43 434 44 44 44 44 44 444 45 45 45 45 45 452 459 462 467 47 47 47 47 474 475 476 48 48 48 482 486 49 49 490 493 499 5 5 5 5 5 5 5 5 5 5 5 503 51 51 51 511 528 529 53 53 53 54 54 54 54 56 56 56 56 57 57 57 576 58 580 59 59 599 6 6 6 6 6 6 6 6 6 6 6 611 615 62 63 63 636 64 64 643 647 65 65 65 65 653 657 658 67 67 670 674 68 68 684 69 69 69 694 1 10 10 10 1027 103 1041 106 108 109 1093 10th 10th 11 1157 12 12 124 12th 13 13 13 13 13 13 13 14 14,000 1402 1498 1498 15 15 1536 1536 1537 154 1540 1544 1563 1582 1588 1588 1588 1590 1595 16 1613 1687 17 17 17 17 17 18 18 18 180 1850 1854 1877 18th 19 19 19 1st 2 2 2 2 2 2 2 20 20 20 20 203 206 21 21 21 21 22 22 22 225 23 23 238 24 24 24th 25 26 26 26 26 267 28 28 283 29 29 2d 3 3 30th 31 32 33 33 34 35 35 35 35 358 36 37 37 382 387 39 4 4 4 4 40 400 414 42 425 43 44 444 45 474 48 4th 5 500 503 51 51 528 529 53 54 57 58 6 6 6 61 64 65 65 653 658 670 674 68 684 69 694 7 7 7 7 7 7 7 73 73 732 74 743 75 752 77 79 7th 8 8 80 80 83 849 849 874 874 89 8th 9 9 9 9 9 90 90 90 914 92 942 98 98 99 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7 7,6 70 70 704 705 71 72 72 73 73 73 73 73 734 74 74 74 743 748 75 75 75 752 76 76 764 77 77 77 770 774 78 782 79 798 7th 8 8 8 8 8 8 8 8 8 8 8 80 81 81 81 82 82 83 84 849 85 86 87 874 874 88 88 89 898 8vo 8vo 9 9 9 9 9 9 9 9 9 9 9 9 9 9 90 90 91 91 911 913 914 92 92 93 93 93 93 936 94 94 942 95 95 951 957 96 97 98 98 98 981 985 99