P-7-29-19-1

Petrarch
Collection
Cautionaries are simply edits to the original content for the purposes of improving the usability and clarity of the informatic design.  Edits should focus on identifying the framework of the original content in its entirety, including redundant messages of cultural or legal significance.  The following edits were made to the content to improve the framework:

  1. Words were stemmed.
  2. Stop Words were used.
  • The Stop Word List:   'a', 'about', 'above', 'above', 'across', 'after', 'afterwards', 'again', 'against', 'all', 'almost', 'alone', 'along', 'already', 'also','although','always','am','among', 'amongst', 'amoungst', 'amount',  'an', 'and', 'another', 'any','anyhow','anyone','anything','anyway', 'anywhere', 'are', 'around', 'as',  'at', 'back','be','became', 'because','become','becomes', 'becoming', 'been', 'before', 'beforehand', 'behind', 'being', 'below', 'beside', 'besides', 'between', 'beyond', 'bill', 'both', 'bottom','but', 'by', 'call', 'can', 'cannot', 'cant', 'co', 'con', 'could', 'couldnt', 'cry', 'de', 'describe', 'detail', 'do', 'done', 'down', 'due', 'during', 'each', 'eg', 'eight', 'either', 'eleven','else', 'elsewhere', 'empty', 'enough', 'etc', 'even', 'ever', 'every', 'everyone', 'everything', 'everywhere', 'except', 'few', 'fifteen', 'fify', 'fill', 'find', 'fire', 'first', 'five', 'for', 'former', 'formerly', 'forty', 'found', 'four', 'from', 'front', 'full', 'further', 'get', 'give', 'go', 'had', 'has', 'hasnt', 'have', 'he', 'hence', 'her', 'here', 'hereafter', 'hereby', 'herein', 'hereupon', 'hers', 'herself', 'him', 'himself', 'his', 'how', 'however', 'hundred', 'ie', 'if', 'in', 'inc', 'indeed', 'interest', 'into', 'is', 'it', 'its', 'itself', 'keep', 'last', 'latter', 'latterly', 'least', 'less', 'ltd', 'made', 'many', 'may', 'me', 'meanwhile', 'might', 'mill', 'mine', 'more', 'moreover', 'most', 'mostly', 'move', 'much', 'must', 'my', 'myself', 'name', 'namely', 'neither', 'never', 'nevertheless', 'next', 'nine', 'no', 'nobody', 'none', 'noone', 'nor', 'not', 'nothing', 'now', 'nowhere', 'of', 'off', 'often', 'on', 'once', 'one', 'only', 'onto', 'or', 'other', 'others', 'otherwise', 'our', 'ours', 'ourselves', 'out', 'over', 'own','part', 'per', 'perhaps', 'please', 'put', 'rather', 're', 'same', 'see', 'seem', 'seemed', 'seeming', 'seems', 'serious', 'several', 'she', 'should', 'show', 'side', 'since', 'sincere', 'six', 'sixty', 'so', 'some', 'somehow', 'someone', 'something', 'sometime', 'sometimes', 'somewhere', 'still', 'such', 'system', 'take', 'ten', 'than', 'that', 'the', 'their', 'them', 'themselves', 'then', 'thence', 'there', 'thereafter', 'thereby', 'therefore', 'therein', 'thereupon', 'these', 'they', 'thick', 'thin', 'third', 'this', 'those', 'though', 'three', 'through', 'throughout', 'thru', 'thus', 'to', 'together', 'too', 'top', 'toward', 'towards', 'twelve', 'twenty', 'two', 'un', 'under', 'until', 'up', 'upon', 'us', 'very', 'via', 'was', 'we', 'well', 'were', 'what', 'whatever', 'when', 'whence', 'whenever', 'where', 'whereafter', 'whereas', 'whereby', 'wherein', 'whereupon', 'wherever', 'whether', 'which', 'while', 'whither', 'who', 'whoever', 'whole', 'whom', 'whose', 'why', 'will', 'with', 'within', 'without', 'would', 'yet', 'you', 'your', 'yours', 'yourself', 'yourselves', 'the'.

  • The Reasoning Behind the Selection - These words are of high frequency, non-unique generality.  They are simply removed to clarify the content, of a more unique terminology, during the analytic stage of modeling.  There are other words that could be included or excluded, as the method of removal isn’t intended to be exact.  However, the terms should be non-unique, of high frequency, and fully disclosed to users of the informatic model.  That is, these terms after the analytic stage are returned to the informatic model in developing the networks, layering, directionality, and detailing of the model. 

Specific Cautionaries

The following cautionaries are more specific to the Petrarch - Collection
  • There were a large variety of numbers and number-letter combinations that marked news sections. All numbers, letter-number combinations not constituting words or abbreviations were removed after the analytic modeling stage.  Some low-frequency of numbers meshing with words were removed as well.  All combinations were removed to improve the usability and clarity of the content being modeled informatically.
  • No words were removed, other than what is listed on the Stop Word list.  These words were removed only for the framing and analytic stages.  Words are returned during the network, layering, and detailing stages of modeling. 
  • Errors involving the content, such as conversion errors of words are not edited and will remain transparent to viewers of the model.  The focus is on developing trust through process and procedure, not through avenues easily manipulated, such as finely-threaded performances of perfection and cosmetic appeal.  Exceptions will be listed in the "specific edits" section.   
  • Split words that are merged back together, if any, will be listed in specific edits.
  • The userability standard is used moderately.  That is, terms like "ebook", or proper nouns, such as publisher names, or any other term reflective of the overall publication, will likely be included into the modeling process.  The models are designed to account for terms that work in different contexts, such as publication terms, that will be presented alongside the design of the actual written work, with the ideas of the given author intact.  
  • This methodology is designed to manage the unstructured informational environment, of a sound and consistent overall design, that manifests from categorical arrangements that are inconsistent and imperfect, like that of a hair style.  Even though terms, these individual hairs, will change, the overall styling, the informatic model, will remain the same.  In this way, the unstructured informational environment differs from the structured informational environment.  

 _a _ahi _alma _almo _amor _amor _amor _amor _amor _amor _amor _amor _amor _amor _amor _amor _amor _amor _amor _amor _anima _anima _anzi _apollo _arbor _aspro _aura _avventuroso _beato _ben _ben _benedetto _bonum _cantai _cara _cercato _cesare _che _chi _chi _chiare _come _come _come _conobbi _cos _d _da _da _datemi _de _deh _deh _del _del _dell _dell _di _di _dicemi _dicesett _discolorato _dodici _dolce _dolci _dolci _donna _due _due _e _e _e _e _e _era _erano _far _fera _fiamma _fontana _fresco _fu _fuggendo _gentil _geri _gi _gi _giovane _giunto _giunto _gli _gli _grazie _h _h _h _h _h _i _i _i _i _i _i _i _i _i _i _i _i _i _il _il _il _il _il _in _in _in _in _in _in _in _io _io _io _io _io _io _io _io _io _io _io _italia _ite _ite _l _l _l _l _l _l _l _l _l _l _l _l _l _l _l _l _l _l _l _l _la _la _la _la _la _la _la _lasciato _lasso _lasso _lasso _lasso _le _levommi _liete _lieti _linterno_ _ma _mai _mai _mente _mentre _mia _mie _mille _mille _mira _mirando _morte _movesi _n _n _n _nom _non _non _non _non _non _non _nova _now_ _o _o _o _o _o _o _o _o _o _o _occhi _occhi _occhi _ogni _on _onde _or _or _orsim_ _orso _orso _ov _ove _p _p _p _p _p _p _p _p _p _p _p _p _p _pace _padre _parr _pasco _passa _passato _passer _per _per _per _perch _perch _perch _perseguendomi _pi _pi _pi _piangete _pien _pien _piovonmi _po _poco _poi _poi _poich _pommi _poscia _qual _qual _qual _qual _qual _quand _quand _quand _quand _quando _quando _quando _quando _quando _quando _quando _quando _quando _quando _quanta _quante _quanto _quanto _quel _quel _quel _quel _quel _quel _quel _quel _quel _quella _quella _quelle _quest _questa _questa _questo _qui _qui _rapido _real _reprinted _rimansi _ripensando _rotta _s _s _s _s _s _s _s _s _s _s _s _se _se _se _se _se _se _se _se _se _se _se _sennuccio _sennuccio _sente _si _siccome _signor _solea _soleano _soleasi _solo _son _spinse _spirto _stanci _standomi _stiamo _tacer _tempo _tennemi _that _the _tornami _tra _tranquillo _tutta _tutto _una _una _vago _vague_ _valle _verdi _vergine _vergognando _vidi _vincitore _vinse _vive _voglia _voi _volo _zefiro 1 1 1 1 10 10 100 102 103 103 105 105 105 107 108 109 11 110 111 112 113 114 115 116 117 118 12 12 120 121 122 123 124 124 125 127 128 129 13 13 130 1304 1304 1305 131 132 1326 1326 1327 1327 1330 1333 1335 1339 1339 134 1341 1341 1342 1343 1344 1345 1345 1346 1346 1347 1347 1347 1347 1348 1348 1348 1349 135 1350 1350 1351 1352 1352 1352 1352 1353 1354 1354 1354 1354 1356 1357 1358 1359 1359 136 1361 1361 1362 1362 1363 1363 1366 1368 1368 137 1370 1370 1372 1373 1374 1375 139 14 14 140 141 143 145 147 148 15 151 152 154 156 157 158 16 16 160 161 163 164 165 166 167 168 169 17 17 170 172 173 174 175 176 176 1765 17650 17650 17650 17650 17650 177 1777 178 179 1795 17th 180 182 183 184 186 1861 1869 188 189 191 192 195 196 197 198 2 20 200 2006 201 2019 202 203 204 205 206 207 209 210 211 213 214 215 216 217 21st 22 221 221 222 224 226 227 228 229 230 231 232 232 233 234 235 236 239 241 242 243 244 245 246 247 248 249 250 251 252 254 255 256 257 259 26 26 260 261 263 264 265 266 266 267 268 269 27 270 271 272 273 275 276 277 278 279 28 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 3 3 3 30 300 301 302 303 304 305 306 307 308 309 31 31 312 313 313 314 315 316 317 319 32 320 321 322 322 322 323 324 326 327 328 329 330 331 332 333 334 337 338 339 34 340 341 342 343 344 345 346 347 348 349 35 350 353 354 355 356 357 358 359 360 361 361 362 364 365 368 37 370 371 374 375 376 377 378 379 381 382 383 384 387 390 392 393 394 396 397 399 4 4 400 400 401 402 404 406 41 411 412 413 414 417 418 43 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 437 44 45 46 47 48 49 4th _a _ahi _al _alia _alma _almo _amor _amor _amor _amor _amor _amor _amor _amor _amor _amor _amor _an _anima _anima _anzi _arbor _aspro _aura _avventuroso _ben _ben _benedetto _bonum _cara _cercato _cesare _che _che _chi _chi _chiare _come _come _conobbi _cos _da _da _datemi _deh _deh _del _del _dell _dell _di _dicemi _dicesett _discolorato _dodici _dolce _dolci _dolci _dominus _donna _due _due _e _e _e _far _fera _fiamma _fontana _francis _fresco _fu _fuggendo _gentil _geri _gi _giovane _giunto _gli _gli _gloriosa _grazie _he _i _i _i _i _i _i _i _i _il _il _il _il _il _in _in _in _in _in _in _in _io _io _io _io _io _io _io _io _italia _ite _ite _kicking _l _l _l _l _l _l _l _l _l _l _l _l _l _la _la _la _la _la _la _la _lasciato _lassare _lasso _lasso _lasso _lasso _lasso _le _levommi _lieti _linterno_ _ma _mente _mentre _mia _mia _mie _mille _mille _mira _morte _my _n _nel _nel _nella _new _non _non _non _non _non _non _non _nova _now_ _o _o _o _o _o _o _o _o _o _occhi _occhi _ogni _oim _on _onde _or _or _or _orbem _orsim_ _orso _ov _ove _p _p _p _p _padre _parr _pasco _passa _passato _passer _per _per _perch _perch _perfectly_ _perseguendomi _pi _piangete _pien _piovonmi _po _poco _poi _poich _pommi _poscia _qual _qual _qual _qual _qual _quand _quando _quando _quando _quando _quando _quando _quando _quanta _quante _quanto _quel _quel _quel _quel _quel _quel _quel _quella _quelle _quest _questa _questa _questa _questo _qui _qui _rapido _real _rimansi _rotta _s _s _s _s _s _s _se _se _se _se _se _se _se _se _se _sennuccio _severe_ _si _siccome _signor _solea _solea _soleasi _son _spinse _spirto _stiamo _tempo _tennemi _that _that_ _the _the _tra _tranquillo _tutta _una _una _vago _vague_ _valle _verdi _vergine _vergognando _vidi _vincitore _vinse _viv _vive _voglia _voi _volgendo _volo _zefiro 1 10 100 11 116th 12 13 1304 1326 1327 1335 1338 1341 1343 1344 1345 1346 1346 1347 1351 1352 1353 1356 1357 1358 1361 1363 1364 1367 1368 1370 1370 1373 1374 1375 14 15 1562 16 1637 17650 17650 17650 17650 17650 17650 17650 17650 1777 1777 1777 1777 1777 1777 1777 1795 1795 1795 1861 1869 2 2000 2006 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 2019 20th 218th 21st 221 266 3 3 313 322 322 361 371 381 394 4 400 406 47 50 50 50 6 6 63 650 650 7 7650 7th 8 8 8 8 8 8 8d 8d 8d 9 99 5 5 50 52 54 55 58 59 60 63 63 65 66 66 67 68 69 7 7 7 70 71 74 75 78 8 8 80 82 82 83 85 88 9 9 91 94 95 98 99 99 cache pg17650 www.gutenberg.org macgregor epub txt cache sonnet nott


I considered eliminating Macgregor in refrence to translation. However, the model is designed to manage all content, so I have chosen to take a more conservative position in regards to edits. Unless the content is a symbol, number, or internet address that distracts from the content, I will leave the information in. Note though, even if I chose to be less conservative in my editing, the major and minor nodes will remain consistent, in regards to the modeling of the content. There isn't necessarily a right or wrong in regards to these details, though having some disclosure of the editing process is needed, so users have option of viewing the details and noting any term of importance, in regard to their research interest, that was removed.

 Second thought, I have decided to remove Macgregor and Nott from the model to improve the userability. Again, as stated in previous posts, the finer edits of the model will be subjective, as the modeler will need to strike a balance between userability and accurately representing the content of the author. For this model, though it isn't necessary, removing Nott and Macgregor does improve the userability of the content the model is representing. For this particular model I will post an additional model displaying the nodes before these term nodes were removed.