[Home ] [Archive]   [ فارسی ]  
:: Main :: About :: Current Issue :: Archive :: Search :: Submit :: Contact ::
Main Menu
Home::
Journal Information::
Articles archive::
For Authors::
For Reviewers::
Registration::
Contact us::
Site Facilities::
::
Search in website

Advanced Search
..
Receive site information
Enter your Email in the following box to receive the site news and information.
..
:: Search published articles ::
Showing 6 results for Classification

Dr ‎farzad Eskandari‎, Ms ‎imaneh Khodayari Samghabadi‎,
Volume 21, Issue 1 (9-2016)
Abstract

‎There are different types of classification methods for classifying the certain data‎. ‎All the time the value of the variables is not certain and they may belong to the interval that is called uncertain data‎. ‎In recent years‎, ‎by assuming the distribution of the uncertain data is normal‎, ‎there are several estimation for the mean and variance of this distribution‎. ‎In this paper‎, ‎we consider the mean and variance for each of the start and end of intervals‎. ‎Thus we assume that the distribution of uncertain data is bivariate normal distribution‎. ‎We used the maximum likelihood to estimate the means and variances of the bivariate normal distribution‎. ‎Finally‎, ‎Based on the Naive Bayesian classification‎, ‎we propose a Bayesian mixture algorithm for classifying the certain and uncertain data‎. ‎The experimental results show that the proposed algorithm has high accuracy.


, , ,
Volume 22, Issue 1 (12-2017)
Abstract

‎Latent class analysis (LCA) is a method of evaluating non sampling errors‎, ‎especially measurement error in categorical data‎. ‎Biemer (2011) introduced four latent class modeling approaches‎: ‎probability model parameterization‎, ‎log linear model‎, ‎modified path model‎, ‎and graphical model using path diagrams‎. ‎These models are interchangeable‎. ‎Latent class probability models express likelihood of cross-classification tables in term of conditional and marginal probabilities for each cell‎. ‎In this approach model parameters are estimated using EM algorithm‎. ‎To test latent class model chi-square statistic is used as a measure of goodness-of-fit‎. ‎In this paper we use LCA and data from a small-scale survey to estimate misclassification error (as a measurement error) of students who had at least a failing grade as well as misclassification error of students with average grades below 14‎.


, , , ,
Volume 24, Issue 2 (3-2020)
Abstract

The Area under the ROC Curve (AUC) is a common index for evaluating the ability of the biomarkers for classification. In practice, a single biomarker has limited classification ability, so to improve the classification performance, we are interested in combining biomarkers linearly and nonlinearly. In this study, while introducing various types of loss functions, the Ramp AUC method and some of its features are introduced as a statistical model based on the AUC index. The aim of this method is to combine biomarkers in a linear or non-linear manner to improve the classification performance of the biomarkers and minimize the experimental loss function by using the Ramp AUC loss function. As an applicable example, in this study, the data of 378 diabetic patients referred to Ardabil and Tabriz Diabetes Centers in 1393-1394 have been used. RAUC method was fitted to classify diabetic patients in terms of functional limitation, based on the demographic and clinical biomarkers. Validation of the model was assessed using the training and test method. The results in the test dataset showed that the area under the RAUC curve for classification of the patients according to the functional limitation, based on the linear kernel pf biomarkers was 0.81 and with a kernel of the radial base function (RBF) was equal to 1.00. The results indicate a strong nonlinear pattern in the data so that the nonlinear combination of the biomarkers had higher classification performance than the linear combination.


Alireza Rezaee, Mojtaba Ganjali, Ehsan Bahrami,
Volume 25, Issue 1 (1-2021)
Abstract

Nonrespose is a source of error in the survey results and National statistical organizations are always looking for ways to
control and reduce it. Predicting nonrespons sampling units in the survey before conducting the survey is one of the solutions
that can help a lot in reducing and treating the survey nonresponse. Recent advances in technology and the facilitation of
complex calculations have made it possible to apply machine learning methods, such as regression and classification trees
or support vector machines, to many issues, including predicting the nonresponse of sampling units in statistics. . In this
article, while reviewing the above methods, we will predict the nonresponse sampling units in a establishment survey using
them and we will show that the combination of the above methods is more accurate in predicting the correct nonresponse
than any of the methods.

Farzad Eskandari, Sima Naghizadeh Ardebili, ,
Volume 25, Issue 2 (3-2021)
Abstract

The Internet of Things is suggested as the upcoming revolution in the Information and communication technology due to its very high capability of making various businesses and industries more productive and efficient. This productivity comes from the emergence of innovation and the introduction of new capabilities for businesses. Different industries have shown varying reactions to IOT, but what is clear is that IOT has applications in all Businesses. These applications have made significant progress in some industries such as health and transportation but is under development in others, namely agriculture and animal husbandry. In fact, the production of data bases on the Internet of Things is one of the main pillars in the field of big data and data science, Therefore, statistical concepts and models that are used in data science can be beneficially implemented in such data. Among the valid statistical models, Bayesian statistics for data is being utilized in these studies. In this research the fundamentals of Bayesian statistics for big data and most notably the data produced by IOT is explained. They have been Pragmatically examined in both road traffic as well as people’s social behavior towards using vehicles, which have had practically and scientifically valid results.
 
Zahra Ahmadian, Farzad Eskandari,
Volume 28, Issue 1 (9-2023)
Abstract

Today, the diagnosis of diseases using artificial intelligence and machine learning algorithms are of great importance, because by using the data available in the study field of the desired disease, useful information and results can be obtained that reduce the occurrence of many deaths. Among these diseases, we can mention the diagnosis of diabetes, which has spread today due to the growth of urban life and the decrease in people's activity. So, it is very important to know whether a person is suffering from diabetes or not. In this article, the data set related to the information of people who have done the diabetes diagnosis test is used, this information is related to 520 people. People are classified into two groups based on whether their diabetes test result is positive or not, and Bayesian classification methods such as Bayesian Support Vector Machine, Naive Bayes, CNK and CatBoost ensemble classification method have been used to conclude which of these The methods can have a better ability to analyze the data and also to compare these methods use accuracy, precision, F1-score, recall, ROC diagram.

Page 1 from 1     

مجله اندیشه آماری Andishe _ye Amari
Persian site map - English site map - Created in 0.04 seconds with 30 queries by YEKTAWEB 4660