Anthony Trollope
Collection
Cautionaries are simply edits to the original content for the purposes of improving the usability and clarity of the informatic design. Edits should focus on identifying the framework of the original content in its entirety, including redundant messages of cultural or legal significance. The following edits were made to the content to improve the framework:
- Words were stemmed.
- Stop Words were used.
- The Stop Word List: 'a', 'about', 'above', 'above', 'across', 'after', 'afterwards', 'again', 'against', 'all', 'almost', 'alone', 'along', 'already', 'also','although','always','am','among', 'amongst', 'amoungst', 'amount', 'an', 'and', 'another', 'any','anyhow','anyone','anything','anyway', 'anywhere', 'are', 'around', 'as', 'at', 'back','be','became', 'because','become','becomes', 'becoming', 'been', 'before', 'beforehand', 'behind', 'being', 'below', 'beside', 'besides', 'between', 'beyond', 'bill', 'both', 'bottom','but', 'by', 'call', 'can', 'cannot', 'cant', 'co', 'con', 'could', 'couldnt', 'cry', 'de', 'describe', 'detail', 'do', 'done', 'down', 'due', 'during', 'each', 'eg', 'eight', 'either', 'eleven','else', 'elsewhere', 'empty', 'enough', 'etc', 'even', 'ever', 'every', 'everyone', 'everything', 'everywhere', 'except', 'few', 'fifteen', 'fify', 'fill', 'find', 'fire', 'first', 'five', 'for', 'former', 'formerly', 'forty', 'found', 'four', 'from', 'front', 'full', 'further', 'get', 'give', 'go', 'had', 'has', 'hasnt', 'have', 'he', 'hence', 'her', 'here', 'hereafter', 'hereby', 'herein', 'hereupon', 'hers', 'herself', 'him', 'himself', 'his', 'how', 'however', 'hundred', 'ie', 'if', 'in', 'inc', 'indeed', 'interest', 'into', 'is', 'it', 'its', 'itself', 'keep', 'last', 'latter', 'latterly', 'least', 'less', 'ltd', 'made', 'many', 'may', 'me', 'meanwhile', 'might', 'mill', 'mine', 'more', 'moreover', 'most', 'mostly', 'move', 'much', 'must', 'my', 'myself', 'name', 'namely', 'neither', 'never', 'nevertheless', 'next', 'nine', 'no', 'nobody', 'none', 'noone', 'nor', 'not', 'nothing', 'now', 'nowhere', 'of', 'off', 'often', 'on', 'once', 'one', 'only', 'onto', 'or', 'other', 'others', 'otherwise', 'our', 'ours', 'ourselves', 'out', 'over', 'own','part', 'per', 'perhaps', 'please', 'put', 'rather', 're', 'same', 'see', 'seem', 'seemed', 'seeming', 'seems', 'serious', 'several', 'she', 'should', 'show', 'side', 'since', 'sincere', 'six', 'sixty', 'so', 'some', 'somehow', 'someone', 'something', 'sometime', 'sometimes', 'somewhere', 'still', 'such', 'system', 'take', 'ten', 'than', 'that', 'the', 'their', 'them', 'themselves', 'then', 'thence', 'there', 'thereafter', 'thereby', 'therefore', 'therein', 'thereupon', 'these', 'they', 'thick', 'thin', 'third', 'this', 'those', 'though', 'three', 'through', 'throughout', 'thru', 'thus', 'to', 'together', 'too', 'top', 'toward', 'towards', 'twelve', 'twenty', 'two', 'un', 'under', 'until', 'up', 'upon', 'us', 'very', 'via', 'was', 'we', 'well', 'were', 'what', 'whatever', 'when', 'whence', 'whenever', 'where', 'whereafter', 'whereas', 'whereby', 'wherein', 'whereupon', 'wherever', 'whether', 'which', 'while', 'whither', 'who', 'whoever', 'whole', 'whom', 'whose', 'why', 'will', 'with', 'within', 'without', 'would', 'yet', 'you', 'your', 'yours', 'yourself', 'yourselves', 'the'.
- The Reasoning Behind the Selection - These words are of high frequency, non-unique generality. They are simply removed to clarify the content, of a more unique terminology, during the analytic stage of modeling. There are other words that could be included or excluded, as the method of removal isn’t intended to be exact. However, the terms should be non-unique, of high frequency, and fully disclosed to users of the informatic model. That is, these terms after the analytic stage are returned to the informatic model in developing the networks, layering, directionality, and detailing of the model.
- Implications of Selection - The methodology generalizes the unstructured information, so regardless of the nuanced changes of a stop word list; which may or may not include some unique terms, or may or may not meet a particular standard asserted as ideal; the given methodology returns these words to the corpus for the informatic modelling, and the generalized form of significant associations are consistently accounted for, even if some words of significant association were treated as stop words initially. That is, there isn't a perfect stop word list, and lists will vary, but the informatic methodology manages these variations for a consistent outcome, so long as most non-unique terminology is removed.
Specific Cautionaries
The following cautionaries are more specific to the Trollope - Collection:
- There were a large variety of numbers and number-letter combinations that marked news sections. All numbers, letter-number combinations not constituting words or abbreviations were removed after the analytic modeling stage. Some low-frequency of numbers meshing with words were removed as well. All combinations were removed to improve the usability and clarity of the content being modeled informatically.
- No words were removed, other than what is listed on the Stop Word list. These words were removed only for the framing and analytic stages. Words are returned during the network, layering, and detailing stages of modeling.
- Errors involving the content, such as conversion errors of words are not edited and will remain transparent to viewers of the model. The focus is on developing trust through process and procedure, not through avenues easily manipulated, such as finely-threaded performances of perfection and cosmetic appeal. Exceptions will be listed in the "specific edits" section.
- Split words that are merged back together, if any, will be listed in specific edits.
- The userability standard is used moderately. That is, terms like "ebook", or proper nouns, such as publisher names, or any other term reflective of the overall publication, will likely be included into the modeling process. The models are designed to account for terms that work in different contexts, such as publication terms, that will be presented alongside the design of the actual written work, with the ideas of the given author intact.
- This methodology is designed to manage the unstructured informational environment, of a sound and consistent overall design, that manifests from categorical arrangements that are inconsistent and imperfect, like that of a hair style. Even though terms, these individual hairs, will change, the overall styling, the informatic model, will remain largely the same, of a consistent arrangement of major nodes. In this way, the unstructured informational environment differs from the structured informational environment.
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
00
01
01
011
090
0â
0â
0e
0f
0f
0f
0f
0f
0f
0f
0f
0f
0h
0h
0h
0h
0h
0h
0h
0h
0h
0h
0h
0h
0n
0n
0n
0n
0n
0n
0n
0n
0n
0n
0n
0n
0n
0n
0n
0n
0n
0n
0r
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1,000
1,009
1,200
1,500
1,668
1,990
1.97
10
10
10
10
10
10,000
10_
10_s
10_s_
100
100
100
100
1000
101
101
102
103
104
105
105
105
106
108
108
109
10s
10s
11
11
11
11
11
11
11
110
1100101
1101
110s
110s
111
111
111
1110
111988
112
112
112
112
113
113
114
115
115
116
116
117
117
118
118
119
119
11â
12
12
12
12
12
120
120
121
122
122
122
123
124
124
125
125
1250
126
127
127
128
128
129
12th
13
13
13
13
13
13
13
130
130
130
131
132
132
133
133
133
134
134
135
135
136
136
137
137
139
14
14
14
141
142
143
145
145
145
146
146
147
147
148
149
149
14th
15
15
150
150
150
1500
151
152
152
153
154
154
156
157
158
158
159
15s
16
16_s
16_s
160
160
160
161
162
162
163
163
164
164
165
166
167
168
168
169
16th
17
17
17
17
17_s
170
170
171
172
172
173
173â
174
175
176
176
177
177
178
178
179
18
18
18
18
18
18
18
18
18
18
18
18
18
18
18
18
18
18
18
18
18
180
180
180
18000
181
182
183
183
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
184
185
185
185
185
185
185â
185â
185â
186
186
186
186
186
186
186
186
186
186
186
186
186
186
186
186
186
186
186
186
186
186
186
186
1862
1864
187
187
187
187
187
187
1871
1873
1874
1875
188
189
1893
19
19
19
19
19
19,999
19_s
19_s
190
191
193
194
195
19500
196
196
198
19th
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1atterly
1e.â
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1s
1s
1s
1s
1s
1s
1s
1s
1s
1s
1s
1s
1s
1t
1v
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
00
000301
011
015th
0â
0â
0f
0f
0f
0f
0f
0h
0h
0h
0h
0h
0h
0h
0h
0h
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1
1,200
10
10
10
104
105
105
108
109
11
11
11
11.30
1100101
1101
111
111
1110
112
112
116
118
119
12
12
12
12
120
122
124
128
12s
13
13
13
130
132
133
133
134
136
137
139
14
14
142
143
146
15
15
150
154
157
158
159
15th
160
160
160
162
163
164
166
1674
168
16th
17
17
170
172
1721
176
178
18
18
18
18
18
18
18
18
18
180
180
18000
18000
18000
182
1820
184
185
185
186
186
186
186
186
187
187
187
1872
1876
188
1893
19
190
193
19500
196
198
19th
19th
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1â
1evertheless
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1n
1oz
1s
1s
1s
1s
1s
1s
1v
1y
2
2
2
2
2
2
2,900
20
20
20
2000
2002
2003
2012
202
2020
2020
2020
2020
203
204
205
206
208
20th
212
2158
217
218
219
21o
220
223
226
23
23000
232
233
234
236
238
239
23rd
23rd
24
240
242
244
246
248
24th
24th
24th
24th
25
250
252
253
254
256
256
258
25th
260
262
263
264
265
266
271
272
273
274
275
275
275
276
279
28
281
282
286
2860
288
28th
29
29
291
2913
292
293
296
298
2a
2â
2â
2â
2â
2b2
2e
2e2
2f
2o
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3.45
30
30,000
300
300
300,000
3000
302
304
3045
306
307
308
309
30th
312
315
316
3166
318
32
32
32
321
322
323
325
326
328
33
33
33
330
332
333
334
335
337
338
34
34
340
344
348
35
350
352
353
354
356
358
359
36
360
362
364
365
372
374
376
378
38
380
383
386
388
392
394
398
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3ea
3o
3rd
3rd
3rd
4
4
4
4
4.8
40
400
402
406
407
408
41
41
410
412
414
416
418
422
424
426
427
427
43
430
432
433
434
436
437
438
439
45,000
455
4599
4599
4599
46
48
49
4h
4th
5
5
5
5
5
5
5
5
5.58
50
50
50
50
500
500
500
5000
51
52
52
5231
5231
5231
5231
56
57
58
6
6
6
6
6
6.22
60
619
62
64
65
650
66
66
6d
7
7
7
7
7.30
7.45
72
73
7381
74
75
750
750
76
78
7â
7o
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8.22
8.45
80
80
8000
80a
82
824
84
844
86
87
89
8th
8th
9
9
90
90
92
94
95
96
97
98
9d
9th
2
2
2
2
2
2
2
2
2
2
2
2
2
2
2,000
2,999
2_d
20
20
20
20,000
200
2000
2002
2003
2012
2020
203
204
205
206
207
208
209
20th
21
21
21
21
2'11
212
2158
217
217
218
219
21o
22
22_s
220
221
223
226
227
227
229
23
23000
23000
231
232
233
234
235
236
237
238
239
23rd
23rd
24
24
240
241
242
243
244
245
246
247
247
249
24th
24th
25
25
25
25
25
25,996
250
250
251
252
253
254
255
256
256
257
25th
26
26
26,666
260
261
262
263
264
265
266
267
268
27
27
271
272
274
275
275
275
276
277
279
28
28
28
28
281
282
283
286
2860
288
28th
28th
28th
29
29
29
29
29
29
29,999
291
2913
292
293
296
298
299
2a
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2â
2b2
2e2
2o
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3
3,000
3,696
3,999
30
30,000
30,000
300
300,000
3000
301
302
303
304
3045
305
306
307
308
308
309
31
31
31
310
311
312
313
315
315
316
3166
317
318
32
32
320
321
322
322
323
325
326
328
329
33
33
33
33
330
331
332
334
335
337
338
339
34
34
34
340
341
342
344
347
348
349
35
35
350
350
3500
352
353
354
356
357
358
359
36
36
360
361
362
363
365
367
37
370
372
3'73
374
375â
376
378
379
38
38
380
381
384
386
388
39
39
391
392
394
397
398
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3â
3o
3rd
3rd
3rd
4
4
4
4
4
4
4
4
4
4
4
4
4
4
4.8
40
40
40,806
400
4000
401
402
403
406
407
41
41
410
412
414
415
416
418
419
42
421
422
423
424
426
427
427
43
43
430
431
432
433
434
435
436
437
438
439
44
45
45,000
450
455
458
4599
4599
4599
4599
4599
46
46,666
466
47
47
48
49
49
4â
4â
4h
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5
5,000
5.58
50
50
50
50
50
500
5000
51
51
52
52
5231
5231
53
53
54
55
55
56
56
56,006
566
57
57
58
58
59
59
59,999
5â
6
6
6
6
6
6
6,000
6,669
6_d
6_d
60
60
60,000
600
61
619
62
62
62
62
63
63
64
64
65
65
650
66
66
66
67
67
68
68
69
69
6â
6d
6d
6m
7
7
7
7
7
7
7,089
7.52
70
70
706
71
72
72
72
73
73
7381
74
74
75
75
750
76
77
77
78
78
79
79
7â
7â
7o
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8
8,696
80
80
800
8000
80a
81
813
82
82
824
827
83
83
84
84
844
85
85
855
86
86
86
869
87
87
88
89
89
8th
9
9
9
9
9
9
9
9
9
9
9
9
90
90
90
91
91
915
92
92
93
93
94
94
95
96
96
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
97
98
98
9â
9d
9d
9s
9th