Ulusal Tez Merkezi

Tez No	İndirme	Tez Künye	Durumu
201388		Yazılım projeleri ölçüm sonuçları veri tabanının oluşturulması ve yeni yazılım projelerinin maliyet tahmininde kullanimi / Construction of software projects measurement result database and usage in new software projects' cost estimations Yazar:MURAT AYYILDIZ Danışman: PROF. DR. OYA KALIPSIZ Yer Bilgisi: Yıldız Teknik Üniversitesi / Fen Bilimleri Enstitüsü / Bilgisayar Mühendisliği Ana Bilim Dalı Konu:Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol = Computer Engineering and Computer Science and Control Dizin:Maliyet tahmini = Cost estimation ; Yazılım mühendisliği = Software engineering	Onaylandı Doktora Türkçe 2007 124 s.

Bilgisayar yazılımlarında maliyet tahmini son zamanlarda oldukça önem kazanmıştır. Yazılımgeliştirme maliyeti her geçen gün artmaktadır. Yazılım maliyet tahmini hem devletler, hem deorganizasyonlar için çok önemli bir problemdir. Planlanan zaman ve bütçeyi aşan çok sayıdaproje mevcuttur. Bunun temelinde baştan bütçe ve zaman tahminini doğru yapamamaktankaynaklanan başarısızlıklar yatmaktadır.Yazılım geliştirme giderek pahallılaşmakta ve bilgi sistem bütçelerinin içinde büyük birmaliyet faktörü olmaktadır. Yazılım geliştirme maliyetleri ölçüm ve kestirimmetodolojilerinin yokluğundan dolayı sık sık kontrol dışına çıkmaktadır. Geçen on yıl içinde,bazı araştırmacılar maliyet tahminleme konusunda çalışmışlardır ancak sonuçlar tatmin ediciolmaktan uzaktır.Bu çalışmada yeni bir yazılım ölçüt kümesi oluşturma, oluşturulan yazılım ölçüt kümesi içinveri toplama ve bu veri kümesi ile bir yapay sinir ağı kullanılarak yazılım maliyet tahminimodeli geliştirme gerçekleştirilmiştir.Bu çalışmada, yapay sinir ağı temelli yazılım maliyet tahminleme uygulama sonuçları,başarısızlık nedenleri incelenmiş ve bir yapay sinir ağıyla yeni hazırladığımız ölçüt kümesinikullanarak bir model oluşturulmuştur. Yazılım maliyet tahminleme çalışmalarında ölçütkümesi seçiminin hayati bir rolü vardır. Özellikle yapay sinir ağı bazlı çalışmalarda ölçütkümesinin seçiminin önemi göz ardı edildiği görülmüştür. Çalışma sonucu oluşturulan modelile elde edilen sonuçlar, geleneksel ölçütler kullanarak yapılan önceki çalışma sonuçlarıylakarşılaştırılmıştır. Karşılaştırmayı yapabilmek için iki tip veri kullanılmıştır. Birinci kısımönceki çalışmalarda genellikle kullanılan yapıcı maliyet modeli verileri (COCOMO :Constructive Cost Model) ve ikinci kısım olarak yeni oluşturulan ölçüt kümesine uygunolarak Türkiye'de bulunan uluslararası bir firmadan toplanan veriler kullanılmıştır. Yazılımmaliyet tahminleme çalışmalarında olan bir diğer zorluk veri toplamanın zaman ve dikkatgerektirdiği gerçeğidir. Üstelik reel sektörde rekabet açısından dezavantaj oluşturma olasılığınedeniyle, birçok kurum bu konuda topladığı verileri eğitim amaçlı da olsa kullandırmayıuygun görmemektedirler.Literaturde yaptığımız incelemede yapay sinir ağı kullanarak yapılan maliyet tahminlemeyöntemleri çalışmalarında genellikle MLP kullanıldığı saptanmıştır. Bu tez çalışmasındaYapay sinir ağı kullanılarak yapılan yazılım maliyet tahmini modeli oluşturmak için MLP(Multi Layer Perseptron) ve bu alanda kullanılmayan Elman yapay sinir ağı modeliuygulanmıştır. Kıyaslamayı sağlayabilmek için MLP ve Elman modellerinin ikisindekullanımında COCOMO 81 ve yeni oluşturulan ölçüt kümesi YEEM (Yıldız EffortEstimation Metrics) kullanılmıştır. Hatasız karşılaştırabilmek için veri kümelerinde eşitsayıda örnek ile test edilmiştir. Daha iyi araştırma adına yeni oluşturulan veri kümesi içindaha büyük bir küme ile de çalışmalar yapılmıştır. Ölçüt kümelerinin karakteristiği ve verimiktarı bu çalışmanın incelediği konulardan biridir. YEEM ölçüt kümesi yapay sinir ağıtopolojisini oluşturmakta kullanılmıştır. YEEM için toplanan veriler ile MLP ve Elman yapaysinir ağları kullanılarak ağ eğitilmiş ve test edilmiştir. Toplanan verilerin daha bütünselkullanımı için çapraz onaylama metodu kullanılmıştır. Toplanan verinin sınırlı sayıda olmasınedeniyle %5,%10 ve %15'lik çapraz onaylama teknikleri uygulanmıştır.Doğru ve nitelikli bir ölçüt kümesi kullanıldığı sürece yapay sinir ağları yazılım maliyettahminleme çalışmalarında başarıyla kullanılabileceği görülmüştür.

Cost estimation of computer software is getting more important. Software developmentbecomes increasingly expensive. Software cost estimation is a very important problem forgovernments and organizations. There are a lot of projects which exceeded planned budgedand time. Incorrect budget and time planning at the beginning is the main failure reason.Software becomes increasingly expensive to develop and is a major cost factor in anyinformation system budget. Software development costs often get out of control due to lack ofmeasurement and estimation methodologies. During last decade, some researches on costestimation have been conducted. In the search of new methodologies to estimate the softwaredevelopment costs but the results were far from being satisfying.In this study, a new software engineering metric set was developed and the data was collectedaccording to new metric set and a new software cost estimation model was developed byusing neural network.In this study we have explored the reasons of the disappointing results of the existing softwarecost estimation with neural network studies and implemented different neural network modelsusing augmented new metrics. The metric-set selection has a vital role in software costestimation studies; its importance has been ignored especially in neural network based studies.The results obtained are compared with previous studies using traditional metrics. To be ableto make comparisons, two types of data have been used. The first part of the data is takenfrom the Constructive Cost Model which is commonly used in previous studies and thesecond part is collected according to new metrics in a leading international company inTurkey. Another difficulty associated with the cost estimation studies is the fact that the datacollection requires time and care. Futhermore, many companies do not share their databecause of competition disadvantage possibilities.In recent literature, we have explored that MLP had been used in software cost estimationwith neural network studies. In this study, software cost estimation by using neural networkmodels presented here are based on Multi-Layer Perceptron (MLP) and Elman neuralnetworks which has not been used for this aim. The results obtained are compared withprevious studies using traditional metrics. The models presented here are based on Multi-Layer Perceptron and Elman Networks for both COCOMO?81 metric set and for theaugmented metric set (YEEM : Yıldız Effort Estimation Metrics). To be able to compare theresults accurately tests were run for datasets containing the same number of samples. Toinvestigate further, we have also experimented with larger datasets formed according toaugmented metrics. Addressing the issues of the dataset characteristics and the amount ofsamples in the datasets is one of the purposes of this research. YEEM has been used toconstruct the neural network topology. The MLP and Elman neural networks has beentrainned and tested by using data for YEEM. To make a more thorough use of the samplescollected, k-fold, cross validation method is also implemented. Since the amount of sampleswe have collected are still limited, 5-fold, 10 fold and 15-fold cross validation techniqueshave been also applied.It is concluded that, as long as an accurate and quantifiable set of metrics are defined andmeasured correctly, neural networks can be applied in software cost estimation studies withsuccess.