20 research outputs found

    Spelling Checker using Algorithm Damerau Levenshtein Distance and Cosine Similarity

    Get PDF
    Writing is an embodiment of the author's ideas that are to be conveyed to others. A writer often experiences typos in typing the script, so that it can influence the meaning of the text. Therefore, a system is needed to detect word errors. In this study, checking is done by using the Dictionary Lookup method and giving the candidate words using the Damerau Levenshtein Distance algorithm. Candidates will then determine the ranking by breaking the word into Bigram form and calculating the similarity value using the Cosine Similarity algorithm. The test results based on the data used yield different Mean Reciprocal Rank (MRR) values for each type of error. The type of error deletion produces an MRR value of 88.89%, the type of insertion error produces an MRR value of 97.78%, the type of substitution error produces an MRR value of 88.89%, the type of transposition error produces an MRR value of 89

    Pencarian Tugas Akhir dengan Ontologi dan Boyer-Moore (Studi Kasus: Jurusan Teknik Informatika UNSRI)

    Get PDF
    Website sipeta.ilkom.unsri.ac.id adalah website yang menampung data tugas akhir mahasiswa Jurusan Teknik Informatika UNSRI. Namun website tersebut menggunakan penyimpanan dengan basis data biasa. Pada penelitian ini membuat pencarian data tugas akhir mahasiswa dengan memanfaatkan web semantik ontologi agar data yang dimiliki tidak hanya memiliki nilai, tetapi juga memiliki pengetahuan tentang relasi antar informasi yang saling berkaitan. Komponen yang digunakan dalam teknologi semantik adalah RDF yang dipergunakan sebagai representasi pengetahuan yang digunakan, kemudian SPARQL yang digunakan sebagai query untuk mengambil informasi yang terdapat dalam Ontologi RDF. Selain itu juga digunakan Algoritma Boyer Moore untuk mendapatkan nilai similarity antara data yang didapatkan dari hasil pencarian dengan keyword yang dimasukkan. Jenis pencarian yang dirancang ada 3 pencarian yaitu keyword search, simple search dan advanced search. Dan ketiga pencarian tersebut juga akan di kombinasikan dengan algoritma Boyer Moore.  Hasil pencarian dengan ontologi dengan pencarian dengan ontologi dan Algoritma Boyer Moore dihasilkan bahwa pencarian dengan Boyer Moore membutuhkan waktu lebih lama secara rata-rata sekitar >=0,0001 perdetik dalam 5 kali percobaan dibandingkan pencarian dengan ontologi saja. Untuk Algoritma Boyer Moore dilakukan pengujian dengan ROC didapatkan hasil akurasi sebesar 99,84% untuk 16 kali percobaan

    Text Summarization with K-Means Method

    Get PDF
    Text Summarization is a tool used to generate a short form of text that contains important information that is needed by the user automatically. In this study, Text Summarization was conducted on Indonesian news using K-Means method. The news is taken from CNN Indonesia with a free topic. K-Means is used to classify sentences that already have weight in the news with 2 clusters, namely text summaries and not text summaries. The initial centroid is selected based on the sentence with the largest value and the sentence with the smallest value. The test conducted on Indonesian news with a total 50 news and tested for feasibility using a questionnaire. K-Means was successfully summarizing the news with an average 27.3 % of original news length and gain 87% good summarize based on respondents from questionnaire

    CORRESPONDENCE ANALYSIS PADA HUBUNGAN FAKTOR-FAKTOR YANG MEMPENGARUHI PENDAPATAN PETANI KOPI PAGARALAM

    Get PDF
    Pagaralam is one of the coffee-producing districts in South Sumatra (Sum-Sel). Pagaralam coffee farming is a hereditary business, where the majority of land processing is still traditional. This is related to working capital and farmers' income. This study aims to analyze the factors that affect the income of Pagaralam coffee farmers by using correspondence analysis. There are 30 variables or factors studied. Each variable is divided into several categories. The categories of each variable are described graphically with the categories of income variable. Primary data were obtained from 196 respondents who were selected based on purposive sampling technique. There are 13 factors that affect the income of respondents, namely: number of dependents, number of trees, age of the trees, number of female workers from outside the family, frequency of fertilization, frequency of herbicide application, production of harvest, production outside of harvest, gross income, minimum price of coffee beans, the maximum price of coffee beans, economic status and land productivity. There are 8 of the 13 factors that predominantly characterize the profile of net income level of Pagaralam coffee farmers.  In general, the factor that must be considered in coffee farming is land productivity which is also related to production costs in land processing and crop production, as well as external factors regarding the market price of coffee

    Determining The Quality and Production of Fresh Vegetables Using Simple Multi - Attributes Rating Technique (SMART) - Fuzzy Tsukamoto

    Get PDF
    Vegetables are one of the most important needs in Indonesia. This is due to the increasing need for healthy food to meet daily needs. With the need for vegetables, the quality and production process are still hampered because it is done manually. Therefore created a system that can help someone determine the quality and production of the right vegetables. This system uses the SMART method and fuzzy Tsukamoto with the criteria and variables of vegetables used to get good quality and production. The SMART and fuzzy Tsukamoto method used a dataset of 20 vegetable commodities. In this study, 4 criteria and 3 variables were used, namely height, soil pH, temperature and age of harvest for quality determination. The production uses the variables of demand, supply and production

    Effect of Genetic Algorithm on Prediction of Heart Disease Stadium using Fuzzy Hierarchical Model

    Get PDF
    The Fuzzy Hierarchical Model method can be used to predict the stage of heart disease. The use of the Fuzzy Hierarchical Model on complex problems is still not optimal because it is difficult to find a fuzzy set that provides a more optimal solution. This method can be improved by changing the membership function constraints using Genetic Algorithm to get better predictions. Tests carried out using 282 heart disease patient data resulted in a Root Mean Squared Error (RMSE) value of 0.55 using the best Genetic Algorithm parameters, including population size of 140, number of generations of 125, and a combination of cross-over rate and mutation rate of 0.4 and 0.6 whereas the RMSE value generated by the Fuzzy Hierarchical Model before being optimized by the Genetic Algorithm was 0.89. These results indicate an increase in the predictive value of the Fuzzy Hierarchical Model after being optimized using the Genetic Algorithm

    Effect of N-Gram on Document Classification on the Naïve Bayes Classifier Algorithm

    Get PDF
    News has become a major need for everyone, with news we can get the information needed. News can be distributed in the form of print mass media, electronic mass media and online media. The means of spreading the news now have grown very rapidly, making the amount of information being managed are bigger and word management classified also not small.  herefore, we need a system for classifying documents that are not structured. In this study, word processing in a document is done by N-Gram as a feature generation. The document classification process is carried out using the Naïve Bayes Classifier algorithm. This study examines the effect of N-Gram on document classification on the Naïve Bayes Classifier algorithm. The results of the classification accuracy of documents by applying N-Gram is 32.68% and without applying N-Gram is 84.97%. A decrease in the classification results occurs the number of features that result from solving N-Gram that is unique or dominant to another category. The accuracy of the results obtained shows that the application of N-Gram in the classification of documents using the Naïve Bayes Classifier algorithm gives a decreased effect on the performance of the classificatio

    MEMBER ELECTION DECISION SUPPORT SYSTEM SOUTH SUMATERA PASKIBRAKA USING TOPSIS-PROMETHEE METHOD

    Get PDF
    Paskibraka is the best young generation selected through various selections to raise and lower the Heritage Flag on Indonesian Independence Day. However, in the enthusiasm of the students to take part, the Dispora of South Sumatra Province still uses a manual assessment system so that several obstacles were found in its implementation. done with Microsoft Excel, as well as a calculation system that can only be used for one period, while this selection is an annual event that is held every time to celebrate Indonesian Independence Day. Therefore we need a way that can help the Dispora of South Sumatra Province in determining the best alternative for paskibraka members. One algorithm that is useful in decision support is Topsis. Topsis is used in the application of values for each criterion and a different range of values. Then using the Promethee method can improve the Topsis method because the Promethee method is used to determine the order of priority in multi-criteria analysis. The data taken by 60 participants were then researched according to predetermined criteria including written test scores, interview tests, health tests, physical fitness, and posture. Produced the best participants according to the system as many as 15 data. The results of the research test have an accuracy of 80%

    Implementasi Metode Analytical Hierarchy Process Dan TOPSIS Dalam Sistem Pendukung Keputusan untuk Pembelian Mobil pada Rental Mobil

    Get PDF
    Car rental is a business engaged in services that provide car rental services. Car rental owners must be selective  in choosing a car to be used as a fleet, because if the car chosen is not right, the rental owner will experience a loss. To solve this kind of problem, a system to support a decision is a solution that can help rental owners according to their wants or needs. One approach that can be used in this case is the Decision Making System using the Analytical Hierarchy Process (AHP) method and the Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS). The Analytical Hierarchy Process (AHP) method was used as weighting and the Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) as a ranking. Testing to get a fairly good accuracy using the AHP and TOPSIS methods

    Klasifikasi Berita Berbahasa Indonesia Menggunakan Naïve Bayes Classifier

    Get PDF
    Berita pada awalnya disalurkan melalui media seperti televisi, radio dan koran, namun dengan kemajuan teknologi saat ini membuat digitalisasi informasi lebih mudah, berita berbentuk teks digital lebih cepat tersebar, aktual dan murah, sehingga dapat mengalami pelonjakan yang besar. Oleh karena itu, perlu adanya sistem yang bisa mengklasifikasikan berita secara otomatis sesuai dengan kategori-kategori berita yang ada, dengan menggunakan metode klasifikasi teks, maka kumpulan dokumen yang jumlahnya sangat besar tersebut dapat diorganisir, sehingga dapat mempermudah dan mempercepat pencarian informasi yang dibutuhkan. Dalam penelitian ini, klasifikasi teks berita menggunakan metode Naïve Bayes Classifier untuk mengklasifikasikan ke dalam empat kategori yaitu, bencana alam, kesehatan, olahraga dan pendidikan.  Pengujian dilakukan sebanyak empat kali dengan pembagian data yang berbeda-beda, dan hasil akurasi yang didapat yaitu pengujian pertama 100%, pengujian kedua 100%, pengujian ketiga 98,33% dan pengujian keempat 96,25%. Dari hasil tersebut, dapat disimpulkan bahwa hasil klasifikasi teks berita sudah baik
    corecore