Voltaire
Collection
Cautionaries are simply edits to the original content for the purposes of improving the usability and clarity of the informatic design. Edits should focus on identifying the framework of the original content in its entirety, including redundant messages of cultural or legal significance. The following edits were made to the content to improve the framework:
- Words were stemmed.
- Stop Words were used.
- The Stop Word List: 'a', 'about', 'above', 'above', 'across', 'after', 'afterwards', 'again', 'against', 'all', 'almost', 'alone', 'along', 'already', 'also','although','always','am','among', 'amongst', 'amoungst', 'amount', 'an', 'and', 'another', 'any','anyhow','anyone','anything','anyway', 'anywhere', 'are', 'around', 'as', 'at', 'back','be','became', 'because','become','becomes', 'becoming', 'been', 'before', 'beforehand', 'behind', 'being', 'below', 'beside', 'besides', 'between', 'beyond', 'bill', 'both', 'bottom','but', 'by', 'call', 'can', 'cannot', 'cant', 'co', 'con', 'could', 'couldnt', 'cry', 'de', 'describe', 'detail', 'do', 'done', 'down', 'due', 'during', 'each', 'eg', 'eight', 'either', 'eleven','else', 'elsewhere', 'empty', 'enough', 'etc', 'even', 'ever', 'every', 'everyone', 'everything', 'everywhere', 'except', 'few', 'fifteen', 'fify', 'fill', 'find', 'fire', 'first', 'five', 'for', 'former', 'formerly', 'forty', 'found', 'four', 'from', 'front', 'full', 'further', 'get', 'give', 'go', 'had', 'has', 'hasnt', 'have', 'he', 'hence', 'her', 'here', 'hereafter', 'hereby', 'herein', 'hereupon', 'hers', 'herself', 'him', 'himself', 'his', 'how', 'however', 'hundred', 'ie', 'if', 'in', 'inc', 'indeed', 'interest', 'into', 'is', 'it', 'its', 'itself', 'keep', 'last', 'latter', 'latterly', 'least', 'less', 'ltd', 'made', 'many', 'may', 'me', 'meanwhile', 'might', 'mill', 'mine', 'more', 'moreover', 'most', 'mostly', 'move', 'much', 'must', 'my', 'myself', 'name', 'namely', 'neither', 'never', 'nevertheless', 'next', 'nine', 'no', 'nobody', 'none', 'noone', 'nor', 'not', 'nothing', 'now', 'nowhere', 'of', 'off', 'often', 'on', 'once', 'one', 'only', 'onto', 'or', 'other', 'others', 'otherwise', 'our', 'ours', 'ourselves', 'out', 'over', 'own','part', 'per', 'perhaps', 'please', 'put', 'rather', 're', 'same', 'see', 'seem', 'seemed', 'seeming', 'seems', 'serious', 'several', 'she', 'should', 'show', 'side', 'since', 'sincere', 'six', 'sixty', 'so', 'some', 'somehow', 'someone', 'something', 'sometime', 'sometimes', 'somewhere', 'still', 'such', 'system', 'take', 'ten', 'than', 'that', 'the', 'their', 'them', 'themselves', 'then', 'thence', 'there', 'thereafter', 'thereby', 'therefore', 'therein', 'thereupon', 'these', 'they', 'thick', 'thin', 'third', 'this', 'those', 'though', 'three', 'through', 'throughout', 'thru', 'thus', 'to', 'together', 'too', 'top', 'toward', 'towards', 'twelve', 'twenty', 'two', 'un', 'under', 'until', 'up', 'upon', 'us', 'very', 'via', 'was', 'we', 'well', 'were', 'what', 'whatever', 'when', 'whence', 'whenever', 'where', 'whereafter', 'whereas', 'whereby', 'wherein', 'whereupon', 'wherever', 'whether', 'which', 'while', 'whither', 'who', 'whoever', 'whole', 'whom', 'whose', 'why', 'will', 'with', 'within', 'without', 'would', 'yet', 'you', 'your', 'yours', 'yourself', 'yourselves', 'the'.
- The Reasoning Behind the Selection - These words are of high frequency, non-unique generality. They are simply removed to clarify the content, of a more unique terminology, during the analytic stage of modeling. There are other words that could be included or excluded, as the method of removal isn’t intended to be exact. However, the terms should be non-unique, of high frequency, and fully disclosed to users of the informatic model. That is, these terms after the analytic stage are returned to the informatic model in developing the networks, layering, directionality, and detailing of the model.
- Implications of Selection - The methodology generalizes the unstructured information, so regardless of the nuanced changes of a stop word list; which may or may not include some unique terms, or may or may not meet a particular standard asserted as ideal; the given methodology returns these words to the corpus for the informatic modelling, and the generalized form of significant associations are consistently accounted for, even if some words of significant association were treated as stop words initially. That is, there isn't a perfect stop word list, and lists will vary, but the informatic methodology manages these variations for a consistent outcome, so long as most non-unique terminology is removed.
Specific Cautionaries
The following cautionaries are more specific to the Voltaire - Collection:
- There were a large variety of numbers and number-letter combinations that marked news sections. All numbers, letter-number combinations not constituting words or abbreviations were removed after the analytic modeling stage. Some low-frequency of numbers meshing with words were removed as well. All combinations were removed to improve the usability and clarity of the content being modeled informatically.
- No words were removed, other than what is listed on the Stop Word list. These words were removed only for the framing and analytic stages. Words are returned during the network, layering, and detailing stages of modeling.
- Errors involving the content, such as conversion errors of words are not edited and will remain transparent to viewers of the model. The focus is on developing trust through process and procedure, not through avenues easily manipulated, such as finely-threaded performances of perfection and cosmetic appeal. Exceptions will be listed in the "specific edits" section.
- Split words that are merged back together, if any, will be listed in specific edits.
- The userability standard is used moderately. That is, terms like "ebook", or proper nouns, such as publisher names, or any other term reflective of the overall publication, will likely be included into the modeling process. The models are designed to account for terms that work in different contexts, such as publication terms, that will be presented alongside the design of the actual written work, with the ideas of the given author intact.
- This methodology is designed to manage the unstructured informational environment, of a sound and consistent overall design, that manifests from categorical arrangements that are inconsistent and imperfect, like that of a hair style. Even though terms, these individual hairs, will change, the overall styling, the informatic model, will remain largely the same, of a consistent arrangement of major nodes. In this way, the unstructured informational environment differs from the structured informational environment.
_r
00
01
02
05161
06
06
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
10
10
10
10
10
10
10
10
10
10
10
10
10
10
10
100
100
100
100
101
101
103
103
104
105
105
105
105
105
106
106
107
107
107
108
108
109
109
109
109
11
11
11
11
11
11
11
11
11
11
110
111
111
111
112
112
113
114
115
1152
116
117
118
119
12
12
12
12
12
12
12
12
12
12
12
12
12
12
12
12
12
12
120
121
122
122
123
124
126
126
128
129
13
13
13
13
13
13
13
13
13
13
13
13
13
130
131
131
132
133
133
134
134
135
136
136
1372
138
1383
139
14
14
14
14
14
14
14
14
14
14
14
14
14
14
14
14
14
141
142
143
144
144
1440
145
145
1455
147
147
148
149
149
15
15
15
15
15
15
15
15
15
15
15
15
15
15
150
150
150
150
150
151
151
152
153
153
1531
154
154
155
155
156
156
157
157
157
158
158
158
159
159
16
16
16
16
16
16
16
16
16
16
16
16
16
16
160
161
1617
162
163
163
163
163
163
164
164
164
164
1646
165
165
165
1650
166
166
166
1661
1667
167
167
167
16731743
1676
168
168
1684
1688
169
169
1695
1698
1699
17
17
17
17
17
17
17
17
17
17
17
17
170
170
1701
1702
1703
1709
1709
171
171
171
171
1711
1715
1715
1716
171785
1718
1718
171973
172
172
1720
1721
1721
1722
1722
1723
1726
1726
1729
173
173
1730
1738
1738
1739
1739
174
174
1741
1747
1749
175
175
1750
1750
1750
1750
1750
1750
1750
1751
1751
1752
1755
1755
1755
1755
1755
1755
1755
1755
1755
1755
1755
1755
1755
1755
1755
1755
1756
1756
1756
1756
1756
1756
1756
1756
1756
1756
1756
1756
1756
1756
1756
1757
1757
1758
1758
1759
1759
1759
1759
176
176
176
176
1761
1762
1764
1766
1767
177
177
1770
1773
1775
1777
1779
178
1783
1783
1787
179
179
179
179
18
18
18
18
18
18
18
18
18
18
180
180
180
1800
181
181
181
1813
182
182
182
183
183
1832
184
184
1841
1844
1846
1847
185
185
186
187
187
1887
189
18972
18972
18972
18972
1898
19
19
19
19
19
19
19
19
19
19
19
19
19
19
19
19
19
19
19
19
19
19
19
19
19
19
190
1904
1908
191
191
1913
1914
1915
1916
1919
192
192
1925
1926
1927
1929
193
193
1933
1933
1934
1936
1937
1938
1939
1939
194
1940
1942
1942
1946
1946
1947
195
195
1953
1954
1955
1955
1955
1955
1956
196
197
198
198
1998
1s
1x
2
2
2
2
2
2
2
2
2
01
010555mbp
05161
1
1
1
1
1
1
10
10
10
10
10
10
10
10
10
10
10
10
10
10,000,000
10,800
100
100,000
101
102
103
104th
105
106
107
107
109
109
109
11
11
11
11
11
11
11
11
11
11
11
110
111
113
1152
117
119
12
12
12
12
12
12
12
120
121
123,249,600
124
125
129
13
13
13
13
13
13
13
131
133
134
135
136
136
137
137
139
14
14
14
14
14
14
14
14
141
143
145
145
1455
149
15
15
15
15
15
15
150
150
150
150
150
153
157
157
157
159
16
16
16
16
16
16
16
16
161
163
164
164
166
167
1692
1698
17
17
17
17
17
17
17
17
171
171
1715
1721
1721
1722
1723
1738
1738
1750
1750
1755
1755
1755
1755
1755
1755
1755
1755
1755
1755
1755
1755
1755
1755
1755
1755
1755
1755
1755
1755
1756
1756
1756
1756
1756
1756
1756
1756
1756
1756
1756
1756
1756
1756
1756
1756
1757
1758
1758
1759
1759
1759
1759
1759
176
176
176
1766
1770
1770
1777
1783
1785
1787
18
18
18
18
18
18
18
18
18
18
18
18
18
180
181
1813
183
1833
185
186
187
188
189
189
189
18972
18972
18972
1898
19
19
19
19
19
19
19
19
1908
191
191
192
1929
193
1933
1934
1939
1946
1954
1955
1955
1955
1956
198
1998
2
2
2
2
2
2
2,2
20
20
20
20
20
20
20
20
20
20
20
20,000
20,000
200
2006
201
201
2019
2019
202
203
204
204
205
207
207
208
208
21
21
21
21
21
21
210
210
211
212
213
213
213
215
216
217
217
219
22
22
22
22
22
221
222
223
224
225
225
22511
226
227
229
23
23
23
23
23
23
231
233
233
235
237
238
239
239
239
239
239
24
24
24
24
24
240
240
244
244
244
245
247
247
247
248
249
249
249
25
25
25
25
25
25
250
250
250
250
251
253
257
26
26
26
26
26
26
267
27
27
27
27
27
27
27
27
28
28
28
28
28
28
28
28
28
29
29
29
29
29
29
29
3
3
3
3
3
3
3
3
3
3
3
30
30
30
30
30
30
30
300
306
31
31
31
31
31
31
31
31
31
32
32
32
32
32
32
32
322
322
33
33
33
33
34
34
34
34
35
350
36
36
36
362
37
37
37
38
38
39
39
39
39
4
4
4
4
4
4
4
4
4
4
4
4
4
4,000
40
40
40
40
40
40
41
41
41
41
42
42
42
42
42
43
43
43
43
44
44
44
44
44
44
45
45
45
46
46
47
47
47
47
47
48
48
48
48
48
489
49
49
49
5
5
5
5
5
5
5
5
50
50
50
51
51
52
52
53
53
53
54
54
54
55
55
55
55
55
56
56
57
57
57
57
57
57
58
58
581
59
59
59
59
6
6
6
6
6
6
6
6
6
6
6
6.9
60
60
61
62
62
62
6239
63
63
63
639
64
64
641
65
65
65
65
66
66
67
67
68
68
68
68
69
69
69
69
6nio
6nio
7
7
7
7
7
7
7
7
7
7
7
7
70
70
70
71
71
71
72
72
73
73
730
74
74
74
75
76
76
77
77
77
78
78
78
78
78
79
79
79
8
8
8
8
8
8
8
8
8
8
8
8
80
80
81
81
81
82
82
83
83
83
84
84
84
85
85
85
86
86
86
87
87
87
87
87
88
88
89
89
89
89
9
9
9,000
90
90
90
91
91
92
92
92
92
93
93
93
93
93
936
936
94
94
94
94
95
95
95
95
96
96
97
97
98
98
98
99
99
99
99
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2,2
2,6
20
20
20
20
20
20
20
20
20
20
20
20
200
200
200
200
2002
2006
201
201
201
201
2019
202
202
202
202
203
203
204
205
207
208
209
21
21
21
21
21
21
21
21
21
21
210
211
212
212
213
213
213
213
213
214
214
215
216
216
216
216
217
217
218
219
219
219
219
22
22
22
22
22
22
22
22
22
22
220
220
220
220
221
221
221
221
222
222
222
223
223
224
224
225
225
225
225
225
22511
226
226
227
228
229
229
229
23
23
23
23
23
23
23
23
23
23
230
230
230
231
231
232
233
233
233
233
233
234
235
235
236
237
237
238
238
238
239
239
239
24
24
24
24
24
24
24
241
243
243
244
244
244
245
245
247
248
248
249
25
25
25
25
25
25
25
25
25
25
25
250
250
251
253
254
26
26
26
26
26
26
26
26
26
267
26th
27
27
27
27
27
27
27
27
27
28
28
28
28
28
28
28
28
28
28
28
28
29
29
29
29
29
29
29
29
29
29
2o
2o4
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3,761
30
30
30
30
30
30
30
300
31
31
31
31
31
31
31
31
31
31
31
31
31
31
31
314
32
32
32
32
32
32
32
32
32
33
33
33
33
33
33
33
335
337
34
34
34
34
34
34
34
347
35
35
35
35
35
36
36
36
36
36
36
36
362
37
37
37
37
37
37
38
38
38
38
38
38
38
39
39
39
39
39
39
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4
40
40
40
40
40
40
40
4000
41
41
41
41
41
41
41
42
42
42
42
42
43
43
43
43
44
44
44
44
44
44
44
45
45
45
45
46
46
46
46
46
468
47
47
47
47
47
47
48
48
48
48
48
486
489
49
49
49
49
49
4ndega
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
50
50
50
51
51
51
51
52
52
52
52
53
53
53
53
53
54
54
54
54
54
54
55
55
55
55
55
55
56
56
56
56
56
57
57
57
57
57
57
57
57
57
57
57
57
58
58
58
58
58
58
58
59
59
59
59
59
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6
6.9
60
60
60
61
61
61
61
61
61
61
61
62
62
62
62
62
6239
63
63
63
63
63
64
64
64
65
65
65
65
65
65
66
66
66
66
67
67
67
67
67
68
68
68
68
68
68
69
69
69
69
69
69
69
6gorgent
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7
7,000
70
70
70
70
71
71
71
71
71
71
71
72
72
72
72
73
73
73
73
73
730
74
74
74
74
74
75
75
75
75
76
76
76
76
77
77
77
77
77
78
78
78
78
78
78
79
79
79
79
79
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
80
80
80
80
81
81
81
81
81
82
82
82
82
82
82
82
83
83
83
83
83
83
84
84
84
84
84
85
85
85
85
85
86
86
86
86
87
87
87
87
87
87
88
88
88
88
88
8859
89
89
89
89
89
9
9
9
9
9
9
9
9
9
9
9
9
9
9
9
90
90
90
90
90
90
90
91
91
91
91
91
92
92
92
93
93
93
93
93
94
94
94
94
95
95
95
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
96
97
97
97
97
98
98
98
98
99
99
99
9s3