Stendhal
Collection
Cautionaries are simply edits to the original content for the purposes of improving the usability and clarity of the informatic design. Edits should focus on identifying the framework of the original content in its entirety, including redundant messages of cultural or legal significance. The following edits were made to the content to improve the framework:
- Words were stemmed.
- Stop Words were used.
- The Stop Word List: 'a', 'about', 'above', 'above', 'across', 'after', 'afterwards', 'again', 'against', 'all', 'almost', 'alone', 'along', 'already', 'also','although','always','am','among', 'amongst', 'amoungst', 'amount', 'an', 'and', 'another', 'any','anyhow','anyone','anything','anyway', 'anywhere', 'are', 'around', 'as', 'at', 'back','be','became', 'because','become','becomes', 'becoming', 'been', 'before', 'beforehand', 'behind', 'being', 'below', 'beside', 'besides', 'between', 'beyond', 'bill', 'both', 'bottom','but', 'by', 'call', 'can', 'cannot', 'cant', 'co', 'con', 'could', 'couldnt', 'cry', 'de', 'describe', 'detail', 'do', 'done', 'down', 'due', 'during', 'each', 'eg', 'eight', 'either', 'eleven','else', 'elsewhere', 'empty', 'enough', 'etc', 'even', 'ever', 'every', 'everyone', 'everything', 'everywhere', 'except', 'few', 'fifteen', 'fify', 'fill', 'find', 'fire', 'first', 'five', 'for', 'former', 'formerly', 'forty', 'found', 'four', 'from', 'front', 'full', 'further', 'get', 'give', 'go', 'had', 'has', 'hasnt', 'have', 'he', 'hence', 'her', 'here', 'hereafter', 'hereby', 'herein', 'hereupon', 'hers', 'herself', 'him', 'himself', 'his', 'how', 'however', 'hundred', 'ie', 'if', 'in', 'inc', 'indeed', 'interest', 'into', 'is', 'it', 'its', 'itself', 'keep', 'last', 'latter', 'latterly', 'least', 'less', 'ltd', 'made', 'many', 'may', 'me', 'meanwhile', 'might', 'mill', 'mine', 'more', 'moreover', 'most', 'mostly', 'move', 'much', 'must', 'my', 'myself', 'name', 'namely', 'neither', 'never', 'nevertheless', 'next', 'nine', 'no', 'nobody', 'none', 'noone', 'nor', 'not', 'nothing', 'now', 'nowhere', 'of', 'off', 'often', 'on', 'once', 'one', 'only', 'onto', 'or', 'other', 'others', 'otherwise', 'our', 'ours', 'ourselves', 'out', 'over', 'own','part', 'per', 'perhaps', 'please', 'put', 'rather', 're', 'same', 'see', 'seem', 'seemed', 'seeming', 'seems', 'serious', 'several', 'she', 'should', 'show', 'side', 'since', 'sincere', 'six', 'sixty', 'so', 'some', 'somehow', 'someone', 'something', 'sometime', 'sometimes', 'somewhere', 'still', 'such', 'system', 'take', 'ten', 'than', 'that', 'the', 'their', 'them', 'themselves', 'then', 'thence', 'there', 'thereafter', 'thereby', 'therefore', 'therein', 'thereupon', 'these', 'they', 'thick', 'thin', 'third', 'this', 'those', 'though', 'three', 'through', 'throughout', 'thru', 'thus', 'to', 'together', 'too', 'top', 'toward', 'towards', 'twelve', 'twenty', 'two', 'un', 'under', 'until', 'up', 'upon', 'us', 'very', 'via', 'was', 'we', 'well', 'were', 'what', 'whatever', 'when', 'whence', 'whenever', 'where', 'whereafter', 'whereas', 'whereby', 'wherein', 'whereupon', 'wherever', 'whether', 'which', 'while', 'whither', 'who', 'whoever', 'whole', 'whom', 'whose', 'why', 'will', 'with', 'within', 'without', 'would', 'yet', 'you', 'your', 'yours', 'yourself', 'yourselves', 'the'.
- The Reasoning Behind the Selection - These words are of high frequency, non-unique generality. They are simply removed to clarify the content, of a more unique terminology, during the analytic stage of modeling. There are other words that could be included or excluded, as the method of removal isn’t intended to be exact. However, the terms should be non-unique, of high frequency, and fully disclosed to users of the informatic model. That is, these terms after the analytic stage are returned to the informatic model in developing the networks, layering, directionality, and detailing of the model.
- Implications of Selection - The methodology generalizes the unstructured information, so regardless of the nuanced changes of a stop word list; which may or may not include some unique terms, or may or may not meet a particular standard asserted as ideal; the given methodology returns these words to the corpus for the informatic modelling, and the generalized form of significant associations are consistently accounted for, even if some words of significant association were treated as stop words initially. That is, there isn't a perfect stop word list, and lists will vary, but the informatic methodology manages these variations for a consistent outcome, so long as most non-unique terminology is removed.
Specific Cautionaries
The following cautionaries are more specific to the Stendhal - Collection:
- There were a large variety of numbers and number-letter combinations that marked news sections. All numbers, letter-number combinations not constituting words or abbreviations were removed after the analytic modeling stage. Some low-frequency of numbers meshing with words were removed as well. All combinations were removed to improve the usability and clarity of the content being modeled informatically.
- No words were removed, other than what is listed on the Stop Word list. These words were removed only for the framing and analytic stages. Words are returned during the network, layering, and detailing stages of modeling.
- Errors involving the content, such as conversion errors of words are not edited and will remain transparent to viewers of the model. The focus is on developing trust through process and procedure, not through avenues easily manipulated, such as finely-threaded performances of perfection and cosmetic appeal. Exceptions will be listed in the "specific edits" section.
- Split words that are merged back together, if any, will be listed in specific edits.
- The userability standard is used moderately. That is, terms like "ebook", or proper nouns, such as publisher names, or any other term reflective of the overall publication, will likely be included into the modeling process. The models are designed to account for terms that work in different contexts, such as publication terms, that will be presented alongside the design of the actual written work, with the ideas of the given author intact.
- This methodology is designed to manage the unstructured informational environment, of a sound and consistent overall design, that manifests from categorical arrangements that are inconsistent and imperfect, like that of a hair style. Even though terms, these individual hairs, will change, the overall styling, the informatic model, will remain largely the same, of a consistent arrangement of major nodes. In this way, the unstructured informational environment differs from the structured informational environment.
0
0
0
0
0
0
0
0
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
10
10
10
10
10
10
10
10
10
10
10
10
10
10
10
100
100
101
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
102
103
105
105
106
107
107
1079
108
108
109
109
1095
10th
11
11
11
11
11
11
11
11
11
11
11
11
11
11
11
11
11
11
11
110
110
1101
111
112
112
113
113
114
114
1144
115
115
116
116
117
117
118
118
119
119
1194
11th
12
12
12
12
12
12
12
12
12
12
12
12
12
12
12
12
12
12
120
120
121
121
1214
122
122
1226
123
123
124
124
125
125
125
1252
126
127
127
128
128
129
129
13
13
13
13
13
13
13
13
13
13
13
13
13
13
13
130
130
131
132
133
1335
134
134
134
1342
135
135
136
137
138
139
139
1396
13th
14
14
14
14
14
14
14
14
14
14
14
14
14
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
140
141
141
142
1423
143
1433
144
146
146
1461
1461
1469
147
148
149
149
15
15
15
15
15
15
15
15
15
15
15
150
150
151
152
1520
153
1533
154
1546
1547
155
155
1550
1552
1552
1562
157
1572
1572
1573
1574
158
158
1583
159
1591
16
16
16
16
16
16
160
1606
161
162
162
1623
163
1632
1639
164
1647
165
1651
1651
1653
166
1663
167
1671
1672
1675
1676
1676
168
1685
169
1694
17
17
17
17
17
17
17
17
170
170
171
171
1710
1711
1713
1715
172
1725
173
1736
1737
1739
1743
1743
1746
1747
1749
175
1754
1754
1754
1755
1759
1759
176
1760
1761
1762
1762
1763
1765
1766
1766
1767
1769
177
177
1771
1771
1772
1774
1774
1775
178
1780
1780
1782
1783
1783
1784
1784
1784
1789
179
1790
1791
1791
1792
1793
1793
1796
1796
1796
1797
1797
1799
17th
17th
18
18
18
18
18
18
18
180
1800
1801
1803
1804
1807
1807
1807
1808
1808
1809
181
1810
1810
1810
1811
1811
1811
1812
1812
1813
1813
1814
1814
1814
1815
1815
1815
1815
1815
1816
1816
1816
1817
1817
1817
1817
1818
1818
1818
1818
1818
1818
1819
1819
1819
1819
1819
1819
1819
1819
1819
1819
1819
1819
1819
1819
182
1820
1820
1820
1820
1820
1820
1820
1820
1820
1820
1820s
1821
1821
1821
1821
1821
1821
1821
1821
1821
1821
1822
1822
1822
1822
1823
1824
1825
1825
1826
183
1830
1830
1836
1838
1839
184
184
1841
1842
1842
185
1854
186
1861
187
1876
188
1888
1889
189
1890
1892
1894
1897
18th
19
19
19
19
19
19
19
19
19
19
190
190
1900
1900
1906
1907
1908
1908
191
192
193
194
195
196
197
198
199
1997
19th
1st
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
20
20
20
20
20
20
20
20
20
0
0
0
0
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
10
10
10
10
10
10
10,000
10,000
10,600
102
103
1079
11
11
11
11
11
11
11
1101
112
115
1170
118
119
119
12
12
12
12
12
121
1214
122
1226
123
125
1252
129
13
13
13
131
132
134
134
134
135
137
1396
13th
14
14
14
14
14
141
141
1423
1429
143
1433
146
146
1461
1462
1469
147
149
149
15
15
15
15
15
15
150
150
151
152
1520
153
1530
1533
154
1546
155
155
1552
1559
1562
1573
158
158
1583
1591
16
16
16
16
160
1600
1606
162
162
1632
1639
1647
1651
1653
1653
166
1663
1672
1675
1694
1699
17
17
17
17
17
17
170
170
171
1710
1711
1713
1715
172
173
1739
1740
1743
1743
1746
1747
1749
1754
1754
1755
1759
1759
1759
176
1761
1762
1763
1766
1766
1767
1769
1769
1769
177
177
1770
1771
1772
1774
1775
1780
1783
1784
1784
1787
1788
1789
1789
1792
1793
1793
1796
1796
1799
18
18
180
1801
1803
1806
1807
1808
1808
1808
181
1810
1810
1811
1812
1812
1814
1814
1814
1814
1814
1815
1815
1815
1815
1816
1816
1816
1817
1817
1817
1817
1818
1818
1818
1819
1819
1819
1819
1819
1819
182
1820
1820
1820
1820
1820
1820
1820
1820
1821
1821
1821
1822
1822
1822
1824
1825
1825
1826
1826
1827.2
1828
1829
183
1830
1830
1830
1830
1830
1834
1835
184
184
185
1853
1876
188
1880
1888
1890
1897
19
19
19
19
19
19
19
190
1900
1908
195
196
197
197
19th
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
20
20
20,600
200
200
2002040798
2016
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
2019
202
208
21
21
21
21
21
21
212
213
213
214
215
217
217
218
22
22
22
22
222
222
223
225
227
23
23
230
234
235
236
238
24
24
240
241
245
246
247
25
25
254
254
25th
26
26
26
26
26
264
264
265
266
267
267
268
27
27
274
275
276
278
279
28
28
28
280
281
283
284
285
286
287
289
289
297
299
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
30
30
30
30
30
302
304
307
309
31
31
31
312
314
317
32
32
320
321
323
325
326
329
33
33
33
331
332
332
334
335
336
337
339
339
342
35
35
36
371
385
39
39
39
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
40
406
41
42
424
43
43
43
44
44
446
45
45
46
466
47
47
47
48
48
486
49
490
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
50
50
503
518
52
52
53
53
53720
55
55
56
57
57
57
58
59
59
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
60
62
62
62
63
63
64
66
69
7
7
7
7
7
7
7
7
7
7
7
7
7
70
70
71
71
73
73
77
79
7on
7th
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
81
83
83
84
85
87
88
89
89
8s
9
9
9
9
9
9
9
9
9
9
9
9
9
9
9
90
94
95
98
9dit
9g
9lan
9phine
9r
200
200
2002040798
2003
201
2016
2019
202
202
204
205
207
208
209
20th
20th
21
21
21
21
21
21
210
211
212
213
213
214
215
216
217
218
219
22
22
22
22
220
221
222
222
223
225
226
227
228
229
23
23
23
23
23
23
23
23
23
230
231
232
233
234
235
236
238
239
23rd
23rd
23rd
23rd
24
24
24
24
24
24
24
240
241
242
245
246
247
248
249
24th
25
25
25
25
25
25
25
251
252
253
254
255
256
257
258
259
25th
26
26
26
26
26
26
26
261
262
263
264
265
266
267
267
268
269
27
27
27
27
27
27
271
272
273
274
275
275
276
277
278
279
28
28
28
28
28
28
28
280
283
284
285
286
287
289
29
29
29
290
290
292
293
294
297
298
299
29th
2nd
2nd
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
30
30
30
30
30
30
30
300
301
302
303
304
305
305
306
307
308
309
30th
31
31
31
31
31
31
310
312
314
315
316
317
318
32
32
32
32
32
32
32
320
321
323
325
326
327
328
329
33
33
33
330
331
332
332
333
334
335
336
336
337
338
339
339
34
34
34
34
342
342
343
344
346
347
35
35
35
35
356
36
36
36
37
37
37
38
38
38
385
39
39
39
3rd
3rd
3rd
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
40
40
40
406
41
41
42
42
42
424
43
43
43
43
43
43
43
44
44
44
44
446
45
45
45
45
46
46
46
46
466
47
47
47
47
47
48
48
48
48
48
486
49
49
49
491
495
4th
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
50
50
50
503
503
51
511
518
52
53
53
53
53
53
537
5372
5372
53720
53720
53720
53720
53720
53720
53720
53720
53720
53720
53720
54
54
54
55
55
55
55
56
56
56
56
57
57
57
57
57
576
576
5763
5763
57638
57638
57638
57638
57638
57638
57638
57638
57638
58
58
58836
59
59
59
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
60
60
61
61
61
62
62
62
62
63
63
63
63
64
64
65
65
65
66
66
67
68
68
68
69
69
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
70
70
70
71
71
71
71
71
72
72
73
73
73
74
74
747
75
75
76
76
77
77
78
78
79
79
7th
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
80
800
81
81
82
82
83
84
84
843
85
86
86
87
87
88
88
89
89
9
9
9
9
9
9
9
9
9
9
9
9
9
9
9
9
9
9
9
9
9
9
9
90
90
91
91
92
92
92
93
93
94
94
95
96
96
96
96
97
98
98
99
99