Andishe _ye Amari

[Home ] [Archive]

[ فارسی ]

Main Menu

Home

Journal Information

Articles archive

For Authors

For Reviewers

Registration

Contact us

Site Facilities

Search in website

Receive site information

Search published articles

Diagnosing diabetes using Catboost and bayesian methods

Zahra Ahmadian, Farzad Eskandari,
Volume 28, Issue 1 (9-2023)

Abstract

Today, the diagnosis of diseases using artificial intelligence and machine learning algorithms are of great importance, because by using the data available in the study field of the desired disease, useful information and results can be obtained that reduce the occurrence of many deaths. Among these diseases, we can mention the diagnosis of diabetes, which has spread today due to the growth of urban life and the decrease in people's activity. So, it is very important to know whether a person is suffering from diabetes or not. In this article, the data set related to the information of people who have done the diabetes diagnosis test is used, this information is related to 520 people. People are classified into two groups based on whether their diabetes test result is positive or not, and Bayesian classification methods such as Bayesian Support Vector Machine, Naive Bayes, CNK and CatBoost ensemble classification method have been used to conclude which of these The methods can have a better ability to analyze the data and also to compare these methods use accuracy, precision, F1-score, recall, ROC diagram.

A Review of Results on Insurance Risk Models

Hossein Samimi Haghgozar, Anahita Nazarizadeh,
Volume 28, Issue 1 (9-2023)

Abstract

Risk means a situation in which there is a possibility of deviation from a predicted result. Insurance is one of the methods of risk exposure that leads to the transfer of all or part of the risk from the insured to the insurer. Insurance policies are usually classified into two types: personal and non-life (non-life) insurance. According to this classification, any insurance policy that deals with the health of the insured person or persons is a personal insurance policy, otherwise it will be a nonlife insurance policy. Many insurances in the insurance industry are classified as non-life insurances. Fire, automobile, engineering, shipping, oil and energy insurances are examples of these insurances. Explanation and calculation of three issues in risk models are very important: the ruin probability, the time of ruin and the amount of ruin. In this article, the main and well-known results that have been obtained so far in the field of non-life insurance; Emphasizing the possibility of ruin, it is given along with various examples.

Metasynthesis in definitions and components of statistical thinking

Anahita Komeijani, Ebrahim Reyhani, Zahra Rahimi, Ehsan Bahrami Samani,
Volume 28, Issue 1 (9-2023)

Abstract

The importance of statistics and its education in today’s world, which is full of information and data, is not hidden from
anyone. Statistical thinking is the main core of correct understanding of statistical concepts, data analysis and interpretation
of phenomena. With the aim of achieving a comprehensive definition of statistical thinking and determining its elements,
the present research has studied the researches of the last thirty years. This descriptive research has been carried out in a
qualitative metacomposite
method to provide an insight into the totality of existing studies. Based on the entry criteria,
123 researches were identified between 1990 and 2022, and finally, after screening, 22 researches were selected for detailed
review and analysis. According to the present metacomposite findings, the elements of statistical thinking are: 1) Being dataoriented:
paying attention to data, identifying the need for data, collecting and considering data, different representations of
data, and methods of converting them to each other. 2) Variability: Considering permanent changes in all phenomena. 3)
Statistical inference: paying attention to the types of sampling, reasoning and inference using statistical models, including the
use of statistical charts and generalizing the results from the sample to the population. [4) Analysis of the statistical context:
combining the statistical problem with the context.

Application of the planned behaviors theory in entrepreneurial tendencies (structural equation approach with demographic variables)

Nasrin Akhoundi, Gh. Moshirian, S. Hatami,
Volume 28, Issue 1 (9-2023)

Abstract

This study aimed to apply the theory of planned behaviors on entrepreneurial tendencies and the effect of this tendency on the development of information technology among 18-30 years old Iranian youth in the winter of 1401. A part of the sample was based on the age group listed in the characteristics of mobile operators (18-30 years old) in Tehran province who received the questionnaire completely randomly using SMS system and sending the link address, and also another section was a of students aged 18-30 years old of The Islamic Azad University of South Tehran Branch, and the research questionnaire was provided to them. The validity of the questionnaire was confirmed by experts in ICT and its reliability was obtained based on Cronbach's alpha test with an alpha coefficient of at least 0.70 (criterion). The results showed that according to the theory of planned behaviors, entrepreneurial tendency has an effect on information technology development in Iranian youth aged 18-30 years.

Optimizing the Estimation of the Traffic Intensity Parameter of Queuing Model ‎$‎M/M/1‎$ under Bayeian, E-Bayesian and Hierarchical Bayesian estimation Methods and based on Stationary Probability and System Cost ‎Function

Shahrastani Shahram Yaghoobzadeh Shahrastani, Amrollah Jafari,
Volume 28, Issue 1 (9-2023)

Abstract

In ‎this ‎article, ‎queunig ‎model ‎‎$‎M/M/1‎$ ‎is ‎Considered, ‎in ‎which ‎the ‎innterarrival ‎of ‎customers ‎have ‎an ‎exponenial ‎disributon ‎with ‎the ‎parameter ‎‎$‎lambda‎$ ‎and ‎the ‎service ‎times‎ ‎have ‎an ‎exponenial ‎disributon with the ‎parameter ‎‎$‎mu‎$ ‎and ‎are ‎independent ‎of ‎the ‎interarrival ‎times.‎ ‎it ‎is ‎also ‎assumed ‎that ‎the ‎system ‎is ‎active ‎until ‎‎$‎T‎$‎. Then, under this stopping time Bayesian, ‎$‎E‎$‎-Bayesian and hierarchical Bayesian estima‎‏‎tion‎s of the traffic intensity parameter of this queuing model are obtained under the general entropy loss function and considering the gamma and erlang prior distributions for parameters ‎$‎lambda‎$ ‎and ‎‎$‎mu‎$‎, respicctively. Then, using numerical analysis and based on a new index, Bayesian, ‎$‎E‎$‎-Bayesian and hierarchical Bayesian estima‎‏‎tion‎s are compared.

Introduction of tools and key indicators to assess the quality of administrative data for use in the population and housing census‎

Mrs Parya Torabi Kahlan, Mrs Lida Kalhori Nadrabadi,
Volume 28, Issue 1 (9-2023)

Abstract

‎The use of administrative resources in censuses provides the possibility of reducing costs‎, ‎improving data quality and producing information with a shorter time sequence‎. ‎The mentioned cases‎, ‎in addition to the annual monitoring of indicators of sustainable development goals‎, ‎can also play a significant role in meeting the growing needs of the country's planning and research system‎, ‎but there are many challenges in this regard‎, ‎one of the most important of which is the assessment of the quality of administrative data‎. ‎Quality assessment is the most important aspect of using registration and administrative data in the census‎, ‎and it is one of the necessities of the registers-based population and housing censuses‎, ‎where traditional quality assessment criteria cannot be used‎. ‎In other words‎, ‎despite the advantages of using register and administrative data in the census‎, ‎there are many key quality risks that need to be addressed and assessed before using them in the census‎. ‎In this paper‎, ‎the methods by which the national statistics offices can evaluate the quality of the data obtained from the registers with the aim of producing high-quality statistical outputs have been reviewed‎. ‎Therefore‎, ‎the tools and key indicators used to quantify the quality assessment in each of the four stages of quality assessment including source‎, ‎input data‎, ‎process and output in the census process are introduced‎.

Exploring limits for the Pearson Correlation Coefficient and its Application for Study of Insurance Losses

Dr. Rahim Mahmoudvand,
Volume 28, Issue 1 (9-2023)

Abstract

Actuarial studies treat insurance losses as random variables, and appropriate probabilistic models are sought to model them. Since losses are evaluated in terms of a unitary amount, distributions with positive support are typically used to model them. However, in practice, losses are often bounded due to policyholder conditions, which must be considered when modeling. While this is not a problem for univariate cases, it becomes complicated for multivariate cases. Copulas can be helpful in such situations, but studying the correlation is crucial in the first step. Therefore, this paper addresses the problem of investigating the effect of restricted losses on correlation in multivariate cases.
The Pearson correlation coefficient is a widely used measure of linear correlation between variables. In this study, we examine the correlation between two random variables and investigate the estimator of the correlation coefficient. Furthermore, we analyze a real-world dataset from an Iranian insurance company, including losses due to physical damage and bodily injury as covered by third-party liability insurance.
Upper and lower limits for both the Pearson correlation coefficient and its estimator were derived. The Copula method was employed to obtain the bounds for the correlation parameters, while order statistics were used to obtain the bounds for the sample correlation coefficient. Furthermore, two methods were used to determine the correlation between physical damage and bodily injury, and the results were compared.
Our findings suggest that the commonly used upper and lower bounds of -1 and +1 for the Pearson correlation coefficient may not always apply to insurance losses. Instead, our analysis reveals that narrower bounds can be established for this measure in such cases. The results of this study provide important insights into modeling insurance losses in multivariate cases and have practical implications for risk management and pricing decisions in the insurance industry.

Determining the variance boundaries of single-mode distributions using power entropy

Manije Sanei Tabas, Mohammadhosein Dehghan, Fatemeh Ashtab,
Volume 28, Issue 1 (9-2023)

Abstract

Variance and entropy are distinct metrics that are commonly used to measure the uncertainty of random variables. While the variance shows how a random variable spreads more than expected, the entropy measure measures the uncertainty of an information approach, in other words, it measures the average amount of information of a random variable.

 For both uniform and normal distributions, variance is a ratio of power entropy. Finding such a monotonic relationship between variance and entropy for a larger class of these two distributions is very important and useful in signal processing, machine learning, information theory, probability and statistics, for example, it is used to reduce the errors of estimators and choose a strategy. gives, on average, the greatest or nearly greatest reduction in the entropy of the distribution of the target location, and the effectiveness of this method is tested using simulations with mining assay models. In this article, the upper bound of the variance for single-mode distributions whose tails are heavier than the tails of exponential distributions is created with the help of power entropy

Extropy of coherent system's lifetime and its application in selecting the preferred system

Abdoslaeed Toomaj,
Volume 28, Issue 1 (9-2023)

Abstract

This paper explore some extropy properties of the lifetime of coherent systems with the assumption that the lifetime distribution of system components are independent and identically distributed. The presented results are obtained using the concept of system signature. To this aim, we first provide an expression for extropy of the lifetime of coherent systems. Then, stochastic extropy comparisons are discussed for coherent systems under the condition that both systems have the same characteristic structure. In cases where the number of system components is large or the system has a complex structure, it is difficult or time-consuming to obtain the exact extropy value of the system lifetime. Therefore, bounds are also obtained for extropy. In addition, a new criterion for selecting a preferred system based on relative extropy is proposed, which considers the lifetime of the desired system closest to the parallel system.

Statistical inference on multi-component stress-strength parameter in three parameter Weibull distribution

Dr. Akram Kohansal, Mrs. Atefeh Karami,
Volume 28, Issue 1 (9-2023)

Abstract

The statistical inference of the multi-component stress-strength parameter, $R_{s,k}$, is considered in the three-parameter Weibull distribution. The problem is studied in two cases. In the first case, assuming that the stress and strength variables have common shape and location parameters and non-common scale parameters and all these parameters are unknown, the maximum likelihood estimation and the Bayesian estimation of the parameter $‎R_{s,k}$ are investigated. In this case, as the Bayesian estimation does not have a closed form, it is approximated by two methods, Lindley and $mbox{MCMC}$. Also, asymptotic confidence intervals have been obtained. In the second case, assuming that the stress and strength variables have known common shape and location parameters and non-common and unknown scale parameters, the maximum likelihood estimation, the uniformly minimum variance unbiased estimators, the exact Bayesian estimation of the parameter $‎R_{s,k}$ and the asymptotic confidence interval is calculated. Finally, using Monte Carlo simulation, the performance of different estimators has been compared.

Bayesian inference for extended Weibull Marshall-Olkin family of distribution.

Dr. Nahid Sanjari Farsipour, Dr. Bahram Tarami, Mrs Zahra Memar Kashani,
Volume 28, Issue 2 (3-2024)

Abstract

Marshall-Olkin introduced a family of distributions which obtained by adding a parameter into other distributions. Santoz-Neto etal study an extended Weibull distribution. In this paper two Raiyle and Pareto extended weibull are studied under some momemts and Bayesian methods with some loss functions such as squared error, entropy, linex, squared error in logarithm and modified linex. Also the MCMC method are study for these two distributions.

Inference in von Mises circular distribution under weighted sampling

Dr Fatemeh Shahsanaei, Dr Rahim Chinipardaz,
Volume 28, Issue 2 (3-2024)

Abstract

Circular data are measured in angles or directions. In many cases of sampling, instead of a random sample, we deal with a weighted model. In such sampling, observations are provided throughout with a positive function, weight function. This article deals with weight distributions in circular data. According to von Mises distrinution is the most widely used distribution for modeling circular data, maximum likelihood estimation of parameters in weighted von Mises distributions is investigated. In a simulation study, different weights are compared in the Van Mises circular distribution.

Application of registration and administrative data in the population and housing census, a case study: some countries in Asia and Pacific region

Mrs Parya Torabi Kahlan, Mrs Lida Kalhori Nadrabadi,
Volume 28, Issue 2 (3-2024)

Abstract

Over the last decades there have been, increasing challenges to the traditional census. Collecting information from every person in a country using traditional methods is a massive and costly exercise and thus a key concern. Further, reduced willingness amongst the population to respond to the census questionnaires and unexpected crises such as the Covid-19 pandemic have maked it increasingly difficult for NSOs to produce reliable figures with the necessary geographical and substantive detail.But developments of new technologies and approaches to data collection mean that there are also emerging opportunities. The increasing desire to use administrative resources in the implementation of censuses has made it possible to reduce costs, improve data quality, and produce frequent information on an annual basis. A study of the different approaches adopted by some countries in the Asia-Pacific region shows that administrative data are currently being used in different ways to support census operations. Examining these approaches will be very useful to help and guide for countries contemplating the use of or expansion of their use of administrative data for censuses. In this article, while reviewing the definition of register and the types of administrative registers used in register-based census, the proceedings taken in some countries in moving towards register-based census are presented.

Ruin Probabilities in the Insurance Individual Risk Model with Probability Transition Matrix of Interest Rate

Dr. Abouzar Bazyari,
Volume 28, Issue 2 (3-2024)

Abstract

Insurance companies are modeled with mathematical and statistical models in terms of their random structure. In this paper, the individual risk model of insurance company with different interest rates in a period of time is considered and assumed that the interest rates have the probability transition matrix with finite and countable state. The finite and infinite time ruin probabilities are computed using the conditional probability on the first claim of density function. Moreover, the upper bounds for the infinite time ruin probability are obtained using the mathematical induction. In the numerical examples, the ruin probabilities for heavy tailed distributions are compared with the obtained probabilities in Bazyari (2022) for the classical individual risk model and also, the infinite time ruin probabilities for light tailed distributions are compared with Lundberg's inequality. The results show that the existence of interest rate with probability transition matrix and having finite state leads to decrease the ruin probabilities.

Optimal model in the family of queuing systems with infinite capacity

Dr. Reza Zarei, Dr. Shahram Yaghoubzadeh Shahrestani, Dr. Amrollah Jafari,
Volume 28, Issue 2 (3-2024)

Abstract

‎The cost function and the system ‎s‎tationary‏‎ probability are two key criteria in the design of queuing systems. In this paper, the aim is to design a single server queuing models with infinite capacity, where the service times in the first model and the interarrival times in the second model are assumed to have an Erlang distribution. For this purpose, a new index based on the cost function and the system reliability probability is introduced, the larger of which indicates the optimality of the model. Several numerical examples and an applied example are presented to explain the computational details of the proposed method.

Investigating the Position of Small and Medium Enterprises in the Economy of Iran and Selected Countries

Mrs Parya Torabi Kahlan, Mr Alireza Zahedian,
Volume 28, Issue 2 (3-2024)

Abstract

‎Small and medium enterprises play a significant role in creating job opportunities‎, ‎gross domestic product‎, ‎increasing the production of domestic products and exports‎, ‎and are considered one of the key pillars of achieving the economic growth and development of countries‎. ‎The purpose of this article is to explain the position of SMEs and their contribution in employment‎, ‎economy‎, ‎export and access to banking resources in some selected countries and compare them with the data available in Iran‎. ‎For this purpose‎, ‎the total number of micro and small establishment in the private sector of the country (less than 50 employees) and the number of their persons employed have been estimated using the data of the labour force survey during 10 years period‎, ‎and their values‎ ‎are predicted for 1404 using the double exponential smoothing method‎. ‎The results indicate that the number of establishments with less than 50 employees in private sector increased during the period under review‎, ‎and by 1404‎, ‎its ‏‎count‎ will reach 4,596,855‎, ‎so that the share of the number of persons employed in these establishments is predicted to be 82.6 ‎percent‎ in the 1404‎. ‎However‎, ‎the evidence shows that the share of these enterprises in bank facilities over the past years was only 4.5 percent‎, ‎which is very low compared to other countries‎. ‎The findings of this research are helpful for the support programs of the government for SMEs‎.

Statistical Perspective of Text Mining

Dr Mahdieh Bayati,
Volume 28, Issue 2 (3-2024)

Abstract

   We live in the information age, constantly surrounded by vast amounts of data from the world around us. To utilize this information effectively, it must be mathematically expressed and analyzed using statistics.
   Statistics play a crucial role in various fields, including text mining, which has recently garnered significant attention. Text mining is a research method used to identify patterns in texts, which can be in written, spoken, or visual forms.
   The applications of text mining are diverse, including text classification, clustering, web mining, sentiment analysis, and more. Text mining techniques are utilized to assign numerical values to textual data, enabling statistical analysis.
   Since working with data requires a solid foundation in statistics, statistical tools are employed in text analysis to make predictions, such as forecasting changes in stock prices or currency exchange rates based on current textual data.
   By leveraging statistical methods, text mining can uncover, confirm, or refute the truths hidden within textual content. Today, this topic is widely used in machine learning. This paper aims to provide a basic understanding of statistical tools in text mining and demonstrates how these powerful tools can be used to analyze and interpret events.

Goodness-of-fit test for ‎randomly censored data

Yasser Hashemzehi, Dr. Seyed Mahdi Amir Jahanshahi, Dr. Mohammad Hosein Dehqan,
Volume 28, Issue 2 (3-2024)

Abstract

In this article, we propose using the L2 divergence to test the Exponentiality for random censored data. Then we compare the ability of the proposed test to detect the Exponential distribution by Monte Carlo simulation with other competing tests, including the Kolmogorov‐Smirnov, Anderson‐darling and Cramer von‐Mises tests, which are based on the empirical distribution function and the information criteria tests. The results of simulation studies show that the proposed test generally performs better than its competitor tests.

Trivariate Modeling of Drought Characteristics in Zahedan City using Vine Copula

Somayeh Hutizadeh, Habib Naderi, ,
Volume 28, Issue 2 (3-2024)

Abstract

Drought is one of the most important concepts in hydrology, which has gained increased significance in recent years,
and the results of its modeling and analysis are crucial for risk assessment and management. This study examines drought at
the Zahedan station during the statistical period from 1951 to 2017 using the standardized precipitation index and explains
multivariate data modeling methods using Vine Copulas. Various models are compared using goodness-of-fit criteria, and
the best model is selected. Additionally, joint return periods are calculated and analyzed.

An effective hybrid method for churn prediction and customer profiling

Ladan Faridi, Dr. Zahra Rezaei Ghahroodi,
Volume 28, Issue 2 (3-2024)

Abstract

Customer churn is one of the major economic concerns of many companies, including banks, and banks have focused their attention on customer retention, because the cost of attracting a new customer is much higher than the cost of keeping a customer.
Customer churn prediction and profiling are two major economic concerns for many companies.
Different learning approaches have been proposed; however, a priori choice of the most suitable model to perform both tasks remains non-trivial as it is highly dependent on the intrinsic characteristics of the churn data.
Our study compares several machine learning methods with several resampling approaches for data balancing of a public bank data set.
Our evaluations, reported in terms of area under the curve (AUC) and sensitivity, explore the influence of rebalancing strategies and difference machine learning methods.
This work identifies the most appropriate methods in an attrition context and an effective pipeline based on an ensemble approach and clustering. Our strategy can enlighten marketing or human resources services on the behavioral patterns of customers and their attrition probability.

Page 13 from 14

First
Previous
1
2
3
...
12
13
14
Next
Last

Persian site map - English site map - Created in 0.06 seconds with 43 queries by YEKTAWEB 4714