Processing file ../../../../maclada/texter/herrarne2.txt Total length 5396 Processing file ../../../../maclada/texter/kallocain2.txt Total length 23625 Processing file ../../../../maclada/texter/Ling/ifw-const.txt Total length 17935 Processing file ../../../../maclada/texter/Ling/lmrisk.txt Total length 4670 Processing file ../../../../maclada/texter/Ling/cap16.txt Total length 2682 Processing file ../../../../maclada/texter/Ling/child.txt Total length 4949 Processing file ../../../../maclada/texter/Ling/pg3.txt Total length 20410 Processing file ../../../../maclada/texter/Ling/hci.txt Total length 2815 Processing file ../../../../maclada/texter/Ling/dept_info.txt Total length 6082 Processing file ../../../../maclada/texter/Ling/feature5.txt Total length 12539 Processing file ../../../../maclada/texter/Ling/carb.txt Total length 17236 Similarity score (herrarne2.txt - kallocain2.txt) : 0.529467 Similarity score (herrarne2.txt - ifw-const.txt) : 0.0078978 Similarity score (herrarne2.txt - lmrisk.txt) : 0.0101527 Similarity score (herrarne2.txt - cap16.txt) : 0.00577163 Similarity score (herrarne2.txt - child.txt) : 0.00837492 Similarity score (herrarne2.txt - pg3.txt) : 0.0122967 Similarity score (herrarne2.txt - hci.txt) : 0.00762297 Similarity score (herrarne2.txt - dept_info.txt) : 0.0134841 Similarity score (herrarne2.txt - feature5.txt) : 0.00811107 Similarity score (herrarne2.txt - carb.txt) : 0.0028388 Similarity score (kallocain2.txt - ifw-const.txt) : 0.018712 Similarity score (kallocain2.txt - lmrisk.txt) : 0.0241445 Similarity score (kallocain2.txt - cap16.txt) : 0.0129925 Similarity score (kallocain2.txt - child.txt) : 0.0212762 Similarity score (kallocain2.txt - pg3.txt) : 0.0229967 Similarity score (kallocain2.txt - hci.txt) : 0.0162316 Similarity score (kallocain2.txt - dept_info.txt) : 0.0390228 Similarity score (kallocain2.txt - feature5.txt) : 0.037102 Similarity score (kallocain2.txt - carb.txt) : 0.034815 Similarity score (ifw-const.txt - lmrisk.txt) : 0.344967 Similarity score (ifw-const.txt - cap16.txt) : 0.346909 Similarity score (ifw-const.txt - child.txt) : 0.295559 Similarity score (ifw-const.txt - pg3.txt) : 0.360342 Similarity score (ifw-const.txt - hci.txt) : 0.213533 Similarity score (ifw-const.txt - dept_info.txt) : 0.227037 Similarity score (ifw-const.txt - feature5.txt) : 0.274859 Similarity score (ifw-const.txt - carb.txt) : 0.0911294 Similarity score (lmrisk.txt - cap16.txt) : 0.327808 Similarity score (lmrisk.txt - child.txt) : 0.316054 Similarity score (lmrisk.txt - pg3.txt) : 0.485553 Similarity score (lmrisk.txt - hci.txt) : 0.198422 Similarity score (lmrisk.txt - dept_info.txt) : 0.283738 Similarity score (lmrisk.txt - feature5.txt) : 0.29587 Similarity score (lmrisk.txt - carb.txt) : 0.190379 Similarity score (cap16.txt - child.txt) : 0.266467 Similarity score (cap16.txt - pg3.txt) : 0.36303 Similarity score (cap16.txt - hci.txt) : 0.20474 Similarity score (cap16.txt - dept_info.txt) : 0.160089 Similarity score (cap16.txt - feature5.txt) : 0.26399 Similarity score (cap16.txt - carb.txt) : 0.0232856 Similarity score (child.txt - pg3.txt) : 0.395803 Similarity score (child.txt - hci.txt) : 0.175748 Similarity score (child.txt - dept_info.txt) : 0.141437 Similarity score (child.txt - feature5.txt) : 0.159 Similarity score (child.txt - carb.txt) : 0.0221769 Similarity score (pg3.txt - hci.txt) : 0.223388 Similarity score (pg3.txt - dept_info.txt) : 0.223421 Similarity score (pg3.txt - feature5.txt) : 0.286161 Similarity score (pg3.txt - carb.txt) : 0.0754652 Similarity score (hci.txt - dept_info.txt) : 0.113987 Similarity score (hci.txt - feature5.txt) : 0.150379 Similarity score (hci.txt - carb.txt) : 0.0255341 Similarity score (dept_info.txt - feature5.txt) : 0.697848 Similarity score (dept_info.txt - carb.txt) : 0.829749 Similarity score (feature5.txt - carb.txt) : 0.700334 Number of clusters: 2 (i,j) S[i,j] (1,0) 0.529467 (i,j) S[i,j] (2,0) 0.0078978 (i,j) S[i,j] (2,1) 0.018712 No match for this text has been found! New cluster built! (i,j) S[i,j] (3,0) 0.0101527 (i,j) S[i,j] (3,1) 0.0241445 (i,j) S[i,j] (3,2) 0.344967 (i,j) S[i,j] (4,0) 0.00577163 (i,j) S[i,j] (4,1) 0.0129925 (i,j) S[i,j] (4,2) 0.346909 (i,j) S[i,j] (5,0) 0.00837492 (i,j) S[i,j] (5,1) 0.0212762 (i,j) S[i,j] (5,2) 0.295559 (i,j) S[i,j] (6,0) 0.0122967 (i,j) S[i,j] (6,1) 0.0229967 (i,j) S[i,j] (6,2) 0.360342 (i,j) S[i,j] (7,0) 0.00762297 (i,j) S[i,j] (7,1) 0.0162316 (i,j) S[i,j] (7,2) 0.213533 (i,j) S[i,j] (8,0) 0.0134841 (i,j) S[i,j] (8,1) 0.0390228 (i,j) S[i,j] (8,2) 0.227037 (i,j) S[i,j] (9,0) 0.00811107 (i,j) S[i,j] (9,1) 0.037102 (i,j) S[i,j] (9,2) 0.274859 (i,j) S[i,j] (10,0) 0.0028388 (i,j) S[i,j] (10,1) 0.034815 (i,j) S[i,j] (10,2) 0.0911294 Number of clusters: 3 old_size=1 Cluster no 1 Cluster new cluster contains the following texts (Similarity with centroid) 0 kallocain2.txt (0.833975) 1 herrarne2.txt (0.909634) Cluster no 1 Please name this cluster: Swedish Cluster no 2 Cluster new cluster contains the following texts (Similarity with centroid) 0 pg3.txt (0.42831) 1 lmrisk.txt (0.490398) 2 cap16.txt (0.37898) 3 child.txt (0.329206) 4 ifw-const.txt (0.441029) 5 hci.txt (0.334048) 6 dept_info.txt (0.854554) 7 feature5.txt (0.810752) 8 carb.txt (0.833728) Cluster no 2 Please name this cluster: English Similarity between clusters Swedish-English : 0.0289046 Please enter the number of the cluster you want to check out more in detail. (Enter -1 to exit program.) 2 Zooming in! Working on text 0 in the cluster English. Working on text 1 in the cluster English. Working on text 2 in the cluster English. Working on text 3 in the cluster English. Working on text 4 in the cluster English. Working on text 5 in the cluster English. Working on text 6 in the cluster English. Working on text 7 in the cluster English. Working on text 8 in the cluster English. Similarity score (pg3.txt - lmrisk.txt) : 0.607513 Similarity score (pg3.txt - cap16.txt) : 0.474047 Similarity score (pg3.txt - child.txt) : 0.604908 Similarity score (pg3.txt - ifw-const.txt) : 0.400583 Similarity score (pg3.txt - hci.txt) : 0.291326 Similarity score (pg3.txt - dept_info.txt) : -0.451412 Similarity score (pg3.txt - feature5.txt) : -0.126671 Similarity score (pg3.txt - carb.txt) : -0.787194 Similarity score (lmrisk.txt - cap16.txt) : 0.39792 Similarity score (lmrisk.txt - child.txt) : 0.489138 Similarity score (lmrisk.txt - ifw-const.txt) : 0.343062 Similarity score (lmrisk.txt - hci.txt) : 0.224581 Similarity score (lmrisk.txt - dept_info.txt) : -0.435066 Similarity score (lmrisk.txt - feature5.txt) : -0.198075 Similarity score (lmrisk.txt - carb.txt) : -0.681878 Similarity score (cap16.txt - child.txt) : 0.406769 Similarity score (cap16.txt - ifw-const.txt) : 0.352303 Similarity score (cap16.txt - hci.txt) : 0.228125 Similarity score (cap16.txt - dept_info.txt) : -0.447781 Similarity score (cap16.txt - feature5.txt) : -0.0996387 Similarity score (cap16.txt - carb.txt) : -0.684009 Similarity score (child.txt - ifw-const.txt) : 0.365987 Similarity score (child.txt - hci.txt) : 0.263719 Similarity score (child.txt - dept_info.txt) : -0.432636 Similarity score (child.txt - feature5.txt) : -0.190047 Similarity score (child.txt - carb.txt) : -0.724692 Similarity score (ifw-const.txt - hci.txt) : 0.192186 Similarity score (ifw-const.txt - dept_info.txt) : -0.412368 Similarity score (ifw-const.txt - feature5.txt) : -0.169086 Similarity score (ifw-const.txt - carb.txt) : -0.623458 Similarity score (hci.txt - dept_info.txt) : -0.427683 Similarity score (hci.txt - feature5.txt) : -0.226155 Similarity score (hci.txt - carb.txt) : -0.551283 Similarity score (dept_info.txt - feature5.txt) : 0.0376772 Similarity score (dept_info.txt - carb.txt) : 0.520132 Similarity score (feature5.txt - carb.txt) : 0.0966352 Number of clusters: 4 (i,j) S[i,j] (1,0) 0.607513 (i,j) S[i,j] (2,0) 0.474047 (i,j) S[i,j] (3,0) 0.604908 (i,j) S[i,j] (4,0) 0.400583 (i,j) S[i,j] (5,0) 0.291326 (i,j) S[i,j] (6,0) -0.451412 (i,j) S[i,j] (6,1) -0.435066 (i,j) S[i,j] (6,2) -0.447781 (i,j) S[i,j] (6,3) -0.432636 (i,j) S[i,j] (6,4) -0.412368 (i,j) S[i,j] (6,5) -0.427683 No match for this text has been found! New cluster built! (i,j) S[i,j] (7,0) -0.126671 (i,j) S[i,j] (7,1) -0.198075 (i,j) S[i,j] (7,2) -0.0996387 (i,j) S[i,j] (7,3) -0.190047 (i,j) S[i,j] (7,4) -0.169086 (i,j) S[i,j] (7,5) -0.226155 (i,j) S[i,j] (7,6) 0.0376772 No match for this text has been found! New cluster built! (i,j) S[i,j] (8,0) -0.787194 (i,j) S[i,j] (8,1) -0.681878 (i,j) S[i,j] (8,2) -0.684009 (i,j) S[i,j] (8,3) -0.724692 (i,j) S[i,j] (8,4) -0.623458 (i,j) S[i,j] (8,5) -0.551283 (i,j) S[i,j] (8,6) 0.520132 Number of clusters: 6 old_size=3 Cluster no 3 Cluster new cluster contains the following texts (Similarity with centroid) 0 pg3.txt (0.783694) 1 lmrisk.txt (0.700848) 2 cap16.txt (0.689229) 3 child.txt (0.736262) 4 ifw-const.txt (0.641486) 5 hci.txt (0.590752) Cluster no 3 Please name this cluster: a Cluster no 4 Cluster new cluster contains the following texts (Similarity with centroid) 0 carb.txt (0.987593) 1 dept_info.txt (0.647799) Cluster no 4 Please name this cluster: b Cluster no 5 Cluster new cluster contains the following texts (Similarity with centroid) 0 feature5.txt (1) Cluster no 5 Please name this cluster: Similarity between clusters a-b : -0.987247 a-FieldStaffInformation : -0.250441 b-FieldStaffInformation : 0.0931222 Please enter the number of the cluster you want to check out more in detail. (Enter -1 to exit program.) 3 Zooming in! Working on text 0 in the cluster a. Working on text 1 in the cluster a. Working on text 2 in the cluster a. Working on text 3 in the cluster a. Working on text 4 in the cluster a. Working on text 5 in the cluster a. Similarity score (pg3.txt - lmrisk.txt) : 0.137866 Similarity score (pg3.txt - cap16.txt) : -0.148964 Similarity score (pg3.txt - child.txt) : 0.0647079 Similarity score (pg3.txt - ifw-const.txt) : -0.213079 Similarity score (pg3.txt - hci.txt) : -0.344978 Similarity score (lmrisk.txt - cap16.txt) : -0.170385 Similarity score (lmrisk.txt - child.txt) : -0.0600722 Similarity score (lmrisk.txt - ifw-const.txt) : -0.188633 Similarity score (lmrisk.txt - hci.txt) : -0.336964 Similarity score (cap16.txt - child.txt) : -0.20348 Similarity score (cap16.txt - ifw-const.txt) : -0.162343 Similarity score (cap16.txt - hci.txt) : -0.30094 Similarity score (child.txt - ifw-const.txt) : -0.205267 Similarity score (child.txt - hci.txt) : -0.309916 Similarity score (ifw-const.txt - hci.txt) : -0.30228 Number of clusters: 7 (i,j) S[i,j] (1,0) 0.137866 (i,j) S[i,j] (2,0) -0.148964 (i,j) S[i,j] (2,1) -0.170385 No match for this text has been found! New cluster built! (i,j) S[i,j] (3,0) 0.0647079 (i,j) S[i,j] (3,1) -0.0600722 (i,j) S[i,j] (3,2) -0.20348 No match for this text has been found! New cluster built! (i,j) S[i,j] (4,0) -0.213079 (i,j) S[i,j] (4,1) -0.188633 (i,j) S[i,j] (4,2) -0.162343 (i,j) S[i,j] (4,3) -0.205267 No match for this text has been found! New cluster built! (i,j) S[i,j] (5,0) -0.344978 (i,j) S[i,j] (5,1) -0.336964 (i,j) S[i,j] (5,2) -0.30094 (i,j) S[i,j] (5,3) -0.309916 (i,j) S[i,j] (5,4) -0.30228 No match for this text has been found! New cluster built! Number of clusters: 11 old_size=6 Cluster no 6 Cluster new cluster contains the following texts (Similarity with centroid) 0 pg3.txt (0.71218) 1 lmrisk.txt (0.793479) Cluster no 6 Please name this cluster: WorkConditions Cluster no 7 Cluster new cluster contains the following texts (Similarity with centroid) 0 cap16.txt (1) Cluster no 7 Please name this cluster: LibraryWorkers Cluster no 8 Cluster new cluster contains the following texts (Similarity with centroid) 0 child.txt (1) Cluster no 8 Please name this cluster: ChildWorkers Cluster no 9 Cluster new cluster contains the following texts (Similarity with centroid) 0 ifw-const.txt (1) Cluster no 9 Please name this cluster: Constitution Cluster no 10 Cluster new cluster contains the following texts (Similarity with centroid) 0 hci.txt (1) Cluster no 10 Please name this cluster: ComputerResearchers Similarity between clusters HardWorking-LibraryWorkers : -0.212296 HardWorking-ChildWorkers : -0.00281625 HardWorking-WorkerEducation : -0.264626 HardWorking-ComputerResearchers : -0.450806 LibraryWorkers-ChildWorkers : -0.20348 LibraryWorkers-WorkerEducation : -0.162343 LibraryWorkers-ComputerResearchers : -0.30094 ChildWorkers-WorkerEducation : -0.205267 ChildWorkers-ComputerResearchers : -0.309916 WorkerEducation-ComputerResearchers : -0.30228 Please enter the number of the cluster you want to check out more in detail. (Enter -1 to exit program.) Zooming in! Working on text 0 in the cluster HardWorking. Working on text 1 in the cluster HardWorking. Similarity score (pg3.txt - lmrisk.txt) : -1 Number of clusters: 12 (i,j) S[i,j] (1,0) -1 No match for this text has been found! New cluster built! Number of clusters: 13 old_size=11 Cluster no 11 Cluster new cluster contains the following texts (Similarity with centroid) 0 pg3.txt (1) Cluster no 11 Please name this cluster: Farmworkers Cluster no 12 Cluster new cluster contains the following texts (Similarity with centroid) 0 lmrisk.txt (1) Cluster no 12 Please name this cluster: LabourLegislation Similarity between clusters Farmworkers-LabourLegislation : -1 Please enter the number of the cluster you want to check out more in detail. (Enter -1 to exit program.)