Browsing by Title

Now showing items 3062-3081 of 4026

Sampling based Frequency Estimation on Massive Wikipedia JSON Documents

Fang, Shuqing (2017)

Big data is now being utilized widely and developed rapidly. The researches on big data area is meaningful as it provides all kind of information. Answering aggregation queries are also very important in both research and commercial fields. In this paper we aim to introduce a sampling method to answer aggregation queries on realistic massive data with controlled relative error bound. We used JSON as the experiment material data which makes it different from the related and existed researches. Wikipeida records are stored as big JSON data provides the realistic data environment which makes the results meaningful and trustworthy. We utilize the Wikipedia big JSON file, process data, modify and adapt the sampling algorithm with given relative error bound. Specifically, preliminary process of big JSON file and implement retrieving interested attributes and store the filtered attributes for sampling use. Then modify the dividing buckets algorithm to divide data into buckets by their similarity and the weight of data group. Then answer the aggregation queries on the sampled data. We analyze the experiment results with the error bound, confidence and running time and the relations of error bound and sample sizes. We expect the results of error bound is under what have given so the results are reliable and the sampling method to dramatically reduce the running time and space for answering aggregation queries.
Sanastonoppimispalvelun käytettävyyden arviointi : systemaattinen kirjallisuuskatsaus

Salento, Tea (2017)

Oppimispalveluiden arvioinnista on vähän kokoavaa ja riittävän kuvailevaa tutkimusta, joten ei ole nopeaa vastausta siihen, kuinka tietynlaisen oppimispalvelun käytettävyyttä voitaisiin arvioida. Tämän tutkimuksen tavoitteena on selvittää, millä tavoin sanastonoppimista tukevan ja sanakirjan sisältävän Sanakirja.fi-palvelun käytettävyyttä voitaisiin arvioida. Menetelmänä käytettiin systemaattista kirjallisuuskatsausta, jonka avulla etsittiin esimerkkejä vastaavanlaisten sanastonoppimispalveluiden tai sähköisten sanakirjojen käytettävyyden arvioinnista. Tulosten mukaan sanastonoppimispalveluita on arvioitu kyselyllä, käytettävyystestillä, haastattelulla, kentällä havainnoimalla ja muilla tavoin. Kyselyistä löytyi esimerkkejä itse kehitetyistä kysymysjoukoista, mutta myös yhtä tunnettua ja yleiseksi tarkoitettua kyselyä käytettiin. Käytettävyystestin käytöstä sanakirjan arviointiin löydettiin yksityiskohtainen kuvaus. Muista tavoista löydettiin vain vähän tietoa. Tulokset voivat toimia esimerkkeinä myös muiden sanastonoppimispalveluiden käytettävyyden arviointiin.
Sand and Dust Storm (SDS) Episodes Classification in the Eastern Mediterranean Region

Al Dulaimi, Qusay (2020)

Sand and dust storms are one of the major regional environmental problems that affect human health. Many environmental studies have focused on airborne dust concentrations observed at different regions and have tried to connect the observations to specific dust source regions. This thesis aims to provide a new dust classifications scheme for the Eastern Mediterranean region, specifically observed in Amman, Jordan. I utilized a combination of a long-term data-base consisting of aerosol particle number concentration in coarse mode (1–10 µm) during November 2013 – July 2018 and air mass back trajectories analysis to visually identify the Sand and Dust Storm (SDS) episodes. The classification included three main source regions of for the submicron dust, namely Sahara, Arabia, and Levant. I also classified the data according to the, episode intensity according to their corresponding number concentrations as no-dust, mild, intermediate, and strong intensities and further classified the range of back trajectories as short, intermediate, long, and very long, which indicates the distance between the observation site and the source region.. The results showed that majority of the dust events and an elevated number of dust days are influenced by a source in Levant and Sahara source region. These events which dominated during 70 days in 2016. The Levant source governed during 60 days during the same period. Other dust sources contributed less to the dusty days, and the lowest dusty days number was due to emissions from Levant & Arabia (19 days). The episode intensity varied censurably and underlined variability from the different source areas. The maximum intensity in the dust episode concentration was linked to Levant & Sahara with a max number concentration of 95 /cm3. The classification method was successful and it was able to establish a dust source database in the Eastern Mediterranean region based on the long-term observations performed in Amman with variable dust concentration and dust periods in different seasons and different meteorological circumstances.
SAP ERP -toiminnanohjausjärjestelmien automatisoitu regressiotestaus

Rytkönen, Riku (2016)

SAP ERP -toiminnanohjausjärjestelmä koostuu yrityksen erillisistä liiketoiminnan moduuleista. Moduuleihin kohdistuu esimerkiksi erilaisia muutospyyntöjä, jotka vaativat kehittäjän tekemään muutoksia ohjelmistokoodiin. Testauksella ja laadunvarmistuksella pyritään varmistamaan järjestelmään kohdistuneet muutokset ennen kuin ne liitetään osaksi ohjelmistokoodia. Regressiotestauksella tarkoitetaan toistuvaa testausta silloin kun järjestelmä tai järjestelmän jokin osa on muuttunut. Manuaalisella ja automatisoidulla regressiotestauksella voidaan havaita mahdolliset virheet, jotka syntyvät muutosten myötä. Tämän tutkielman tarkoituksena on selvittää, miksi kaksi kohdeyritystä eivät käytä SAP ERP -toiminnanohjausjärjestelmän automatisoitua regressiotestausta, vaikka useiden eri tieteellisten tutkimusten mukaan sillä saavutetaan kustannussäästöjä sekä testauksen läpivientiajan lyhenemistä. Tarkoituksena on myös havainnollistaa kohdeyritysten testaus- ja laadunvarmistusprosesseja sekä selvittää, kuinka kohdeyritykset suorittavat SAP ERP -toiminnanohjausjärjestelmän regressiotestausta. Tutkielmassa havaitaan, että molemmat kohdeyritykset ovat valmiita tulevaisuudessa hyödyntämään SAP ERP -toiminnanohjausjärjestelmän automaattista regressiotestausta. Kohdeyritykset näkevät kuitenkin vaatimuksena automaation käyttöönotolle kustannusarviot, jotka on laskettu yritysten omia liiketoimintaprosesseja käyttäen. Tutkimuksessa myös havaitaan, että SAP ERP -toiminnanohjausjärjestelmän testiautomaation käyttöönotto vaatii kuitenkin kohdeyrityksiltä toimivan muutostenhallintaprosessin.
Sarakeorientoitunut keskusmuistitietokanta tietokantajärjestelmän osana

Kärki, Arto (2021)

Perinteisten levyperustaisten relaatiotietokantojen taulujen sisältämä tieto talletetaan riveittäin peräkkäistiedostoihin ja rivi koostuu yleensä useista eri sarakkeista. Tällöin tietokantaan kohdistettava yleensä SQL-kielinen kysely saa haettua yhden rivin tiedot nopeasti, varsinkin jos kyseinen rivi sattuu olemaan jo valmiiksi muistissa. Jos kyselyllä haetaankin tietoja useilta sadoilta, ellei jopa tuhansilta riveiltä ja kyselyyn sisältyy hakuehto, jossa voi olla mukana useita sarakkeita, niin kyselystä tulee yleensä aina hidas, elleivät rivit sitten sattumalta ole järjestetty hakutekijän mukaan. Lisäksi jos haettavat rivit eivät ole valmiina muistissa, niin järjestelmä joutuu tekemään hitaan tiedon haun levyltä ja tämä hidastaa entisestään halutun tulosjoukon muodostamista. Sarakeorientoituneessa tietokannassa kunkin taulun sarakkeen tieto voidaan tallettaa omaan tiedostoonsa, tai sitten kunkin sarakkeen tiedot sijaitsevat peräkkäin samassa tiedostossa. Kunkin sarakkeen tiedot voidaan myös tiivistää ja säästää näin tilaa. Sarakeorientoituneesta taulusta tietyn rajatun osajoukon hakeminen on usein nopeampaa, koska kysely voidaan kohdistaa vain haluttuihin sarakkeisiin ja koska sarakkeet arvot ovat peräkkäin, ovat tiedot näin nopeasti saatavilla. Jos tiedot eivät ole jo valmiiksi muistissa, niin myös tiedon hakeminen ulkoiselta tiedostolta nopeutuu, koska halutut tiedot ovat peräkkäin levyllä. Kun sarakeorientoitunut tietokanta sijoitetaan keskusmuistiin, niin myös mahdollinen levyltä lukemiseen kuluva viive poistuu, ja tulosten saanti nopeutuu entisestään. Tutkielmassa tarkastellaan sarakeorientoituneen keskusmuistitietokannan toimintaa ja sen liittämistä osaksi muuta tietokantajärjestelmää. ACM Computing Classification System (CCS): Information systems → Main memory engines Information systems → Data compression Information systems → Database query processing
Sarasuo-rahkasuovaihettuman kasvillisuushistoria neljällä boreaalisella rahkasuolla : ajallinen ja alueellinen näkökulma

Mellais, Annina (2013)

The basis of this study was to examine the variation in the history of mire vegetation and to use the results to predict how mires might develop in the future. In the Finnish conditions, the northernmost mires are typically minerotrophic fens and, further south, the most common ones are ombrotrophic bogs. The borderline between fen and bog zones follows the climatic factors and may therefore be sensitive to changes in these factors. Climate change could potentially trigger ombrotrophication i.e. the development from fen to bog. The purpose of this study is to determine when, why, and how quickly the studied mires have developed from fen to bog, and which species have played a key role in the ombrotrophication. A further aim of this study is to examine whether the obmrotrophication has been driven by the autogenic or allogenic processes and how the location of the mire has affected the ombrotophication. The study examined the mire development from the beginning of the mire formation until the bog phase. The research methods used in this study were macrofossil analysis and radiocarbon dating. Altogether, 200 individual samples were studied. The research material was also approached through multivariate analyses. Detrended correspondence analysis (DCA) was used to find out differences between plant species and distribution of the data. Variation partitioning was used to find out how peat thickness, fires and autogenic development effect the variation of the mire vegetation. The development of each of the four bogs is similar: the development starts from the fen phase from which it changes to cotton grass (Eriophorum vaginatum) dominated phase and finally to Sphagnum dominated bog. The most important key species are cotton grass and ombrotrophic Sphagnum mosses. Fires seem to have triggered ombrotrophication of the Siikaneva and Lyali bogs. The Honkaneva bog is located in a land uplift area which makes its development unique. In all the studied sites, ombrotophication took place rather slowly over thousands of years, but there have also been some rapid changes especially during fires. The further south the mire is located, the earlier the ombrotrophication has started. The fen-bog transition turned out to be more complex than what has been thought, and, therefore, it was divided into two phases: the Carex-Eriophorum transition and the Eriophorum-Sphagnum transition. Changes in the vegetation could not be explained only on the basis of a single factor. The autogenic factors, the climate and the disturbances all seem to have affected the vegetation. The most important factor was estimated to be the autogenic development. Climate change can either speed up or slow down the ombrotrophication process and it can happen through a dry or wet phase.
Satamasta urbaaniksi kantakaupungiksi : Alueen käyttötarkoituksen muutos Länsisataman alueella

Tammilehto, Stefan (2014)

Port cities have through times been in the forefront of cultural, social and technological changes. The cityspace of Helsinki is in this sense also a target of consistant change. One of the biggest challenges has been the relocation of the harbour to Vuosaari, which has opened the Western Harbour for redevelopment. Western Harbour and particularly Jätkäsaari, which is now under redevelopment, is in the focus of this thesis. Especially the development and evolution from a harbour area to an urban part of inner-city is being investigated. How does the general historical development of harbours appear like? Harbours have elsewhere in the world became underused areas in central city locations, since they were for a long time physically, socially and economically isolated from the rest of the city structure. What is the restructuring of Western Harbour like - the phases of the process and the reasons for the change of use? The redevelopment of central harbour areas and industrial waterfronts forms a unique opportunity, but their legacy often consists of a polluted environment that is in a comprehensive need of renovation. This port-city interface is a zone of contested and overlapping challenges, where modern city planning meets the area of port operations. What are the prerequisites of good urban fabric and design in the redevelopment of an old harbour area? This thesis forms a qualitative and comparative case study. Brian Hoyle's Port-city evolution and interface theories are used to illustrate the temporal developments and changes in port cities. Yehuda Hayuth's model about Trends and developments on the port-city interface complements Hoyles model. The redevelopment of Western Harbour and Jätkäsaari is also compared to HafenCity Hamburg. With the help of Rinio Bruttomesso and Michael Clark we form an image about the conditions of good urban fabric in the former harbour areas and of what that is required from them to ensure a more effective and successful entity. The global and historical development of port cities complies well with Hoyles Port-city theory. Port cities were a single unit at the beginning of the evolution, but became separated over the years, as ports moved to more suitable locations (due to increasing vessel sizes, need of space). Hence, the regeneration and reintegration (besides physically, also culturally and spiritually) of former harbour areas has becomed the challenge. The developments that are demonstrated through Hoyles theory are caused by technological, social and economic changes. The introduction of steamships and containers form the two most important singular contributing causes, that have controlled the development of harbours. The Port-city theory describes well the development of Western Harbour, the final stages take place here relatively late, but in a faster pace than elsewhere. The RAMA-survey, the citys growth and development needs, as well as the location of the port constituted problems that worked as an initiation for the transformation process of Western Harbour. The redevelopment constitutes a challenge, where the ability to solve intersecting goals and purposes, environmental problems and the aim of creating a good city structure is at the core. Good urban design in the former port areas is based on determining the nature, range of functions and features as well as their diversity and the blending of them in the area. Also the utilization of old port structures - houses and elements together with the purity of the surrounding waters constitute contributing themes. A number of solutions related to the organizing of the cityspace, like opening up the waterfront and the ritualization of it, along with the chance to create a pedestrian- and public transit-city and build innovative and bold architecture are all central themes that have a chance of being realized on the waterfront. The construction of Western Harbour gives Helsinki a new maritime image and renews the city in its original core. It is an inner-city expansion, which has a large impact on the image of southern Helsinki - its city structure and city life. The objective is to build a city for the unknown future, which is both a challenge and an opportunity.
Sateen intensiteetti Suomen kesäsateissa säätutkamittausten mukaan

Inkinen, Mikko (Helsingin yliopistoUniversity of HelsinkiHelsingfors universitet, 2003)

Sateen intensiteettiä (R) oli ennen tätä työtä tutkittu Suomessa pääasiassa maanpinnalla tehdyistä sademittarimittauksista. Tässä työssä hyödynnettiin säätutkan ylivoimaista ajallista ja alueellista resoluutiota havaita sadetta sademittareihin verrattuna, vertailtiin säätutkan ja sademittarin mittaamaa sateen intensiteettiä ja muodostettiin valtavasta havaintoaineistosta sateen intensiteetin todennäköisyysjakaumia, joilla on sovellusarvoa mm. mitoitettaessa kaupunkien viemäriverkostoja. Teoreettisena sateen intensiteetin todennäköisyysjakaumana käytettiin lognormaalijakaumaa. Säätutka-sademittarivertailussa havaintoaineistona oli noin 60000 Vaisalan FD12P:llä mitattua havaintoa kymmenen minuutin sateen intensiteetistä. Säätutkamittauksia Ilmatieteen laitoksen säätutkilta oli kaikkiaan noin 10 miljardia Suomen maa-alueilta Lappia lukuun ottamatta, joista saderajaksi valittu 10 dBZ-yksikköä ylittyi noin 6,7 % havainnoista. Havaintoaineistot oli kerätty kesä-, heinä- ja elokuilta 2000-2002. Säätutka mittaa tutkaheijastavuustekijää (Z) vaikutustilavuudesta, joka kasvaa ja nousee ylemmäs maanpinnasta mitä kauempana tutkasta mittaus tehdään. Vaikutustilavuudessa voi olla vesipisaroiden lisäksi mm. lumihiutaleita, rakeita, lintuja, hyönteisiä ja korkeita rakennuksia. Kun otetaan aineistoa vain 40-100 km:n etäisyydeltä tutkista ja tehdään siihen raekorjaus, on saaduissa Z:n todennäköisyysjakaumissa lähinnä vesipisaroista saatuja mittausarvoja. Näin voidaan käyttää R(Z)-muunnosta Z=250R1,5 ja saada hetkellisen aluesadannan todennäköisyyksiä 0-1,2 km2:n kokoisille alueille. Tällaiset hetkellisen aluesadannan todennäköisyysjakaumat voidaan samaistaa 0-2,4 minuutin pistesadannan mittauksiksi, kun oletetaan, että satava alue liikkuu keskimäärin 10 m/s. Saatujen tulosten mukaan 1,5 minuutin pistesadannan intensiteetti ylittää yksittäisessä havaintopaikassa Suomen maa-alueilla kerran vuodessa noin 90 mm/h, kerran sadassa vuodessa noin 400 mm/h ja kerran 10000 vuodessa noin 1200-1600 mm/h.
Säteilypakote-käsitteen oppiminen Ilmasto.nyt-kurssilla

Martikainen, Jyrki (2019)

Tässä tutkielmassa tutkittiin säteilypakote-käsitteen oppimista Ilmasto.nyt-kurssilla ja kurssin taustamuuttujien vaikutusta kurssin suorittamiseen. Kurssi on monitieteinen ja se toteutettiin monimuoto-opetuksena. Ilmasto.nyt-kurssille osallistui monella tapaa heterogeeninen opiskelijajoukko. Kurssin suoritti vuosina 2016 ja 2017, 172 opiskelijaa, joista 38 otettiin mukaan tähän tutkimukseen. Kurssille osallistuneita tutkittiin sukupuolen, opiskeluvuoden, tiedekunnan, vastausten pituuden ja suorituskielen mukaan. Tutkimuksessa havaittiin että 5.-6. vuoden opiskelijat ja äidinkielellä vastanneet saivat muita parempia arvosanoja. Pidemmät vastaukset paransivat arvosanoja keskimäärin. Opiskelijat valitsivat kysymyspatterista vastattavaksi erilaisia kysymyksiä riippuen opiskelijan tiedekunnasta. Kurssia arvioitiin haastavaksi, mutta hyödylliseksi ja kattavaksi. Tutkimuksessa käytettiin tilastollisia menetelmiä, eri taustamuuttujien ja arvosanojen välisten riippuvuuksien selvittämiseksi.
Säteilysumun tunnistaminen ja analysointi Kivenlahden maston havaintojen avulla

Isolähteenmäki, Pia (2019)

Tämän pro gradu -tutkielman tavoitteena on selvittää onko säteilysumutilanteiden tunnistaminen mahdollista maston eri korkeuksilta saatavien lämpötila- ja kosteushavaintojen avulla ja tarkastella mahdollisten tunnistettujen säteilysumutapausten elinkaarta. Tutkielmassa käytettävä aineisto koostuu Espoon Kivenlahdessa sijaitsevan tv-maston vuosien 2014-2018 havainnoista ja säteilysumutilanteen tunnistamisen onnistumista arvioidaan Espoon Tapiolan ja Nuuksion sekä Helsinki-Vantaan pintasääasemien näkyvyysmittausten perusteella. Tutkimuksessa valitaan maston kahden alimman mittaustason (2m ja 26m) lämpötila- ja kosteushavainnoille suodatuskriteerit, joiden perusteella löydetään 126 mastohavaintojen nojalla säteilysumuksi tulkittavaa tapausta. Näistä tapauksista 38 täyttää alle 1km näkyvyysehdon jollain pintasääasemalla. Koska pintasääasemat sijaitsevat vaihtelevien etäisyyksien päässä mastosta ja pintasääasemien olosuhteissa on paikallista vaihtelua, työssä analysoidaan näkyvyyssuodatettujen tapausten lisäksi myös kaikkien kriteerit täyttävien tapausten elinkaarta. Aineistoa analysoidaan aritmeettisen keskiarvon ja keskihajonnan avulla. Havaintojen nojalla määriteltyjä säteilysumutapauksia edeltävät meteorologiset olosuhteet ovat samansuuntaiset muualla saatujen aiempien tutkimustulosten kanssa. Ennen sumun muodostumista tuulen nopeus heikkenee, lämpötila laskee ja suhteellinen kosteus kasvaa. Tutkimuksen säteilysumutilanteet jaetaan kasvukorkeuden perusteella paksuihin (korkeus ≥ 26m ) sekä ohuisiin (2m ≤ korkeus < 26m) tapauksiin. Paksuja säteilysumutapauksia löytyy kaikkien (näkyvyyssuodatettujen) tapausten joukosta 58 (25) ja ohuita tapauksia 56 (13). Keskimääräinen tuulen suunta on sekä ohuiden että paksujen tapausten osalta etelästä, joten tuulen suunta ei anna suoraa selitystä säteilysumujen kasvukorkeudelle. Ylempien tasojen kosteusolosuhteet sen sijaan antavat viitteitä sumun kasvukorkeuden kehittymisestä suhteellisen kosteuden ollessa keskimäärin 10 prosenttiyksikköä korkeampi paksujen tapausten kuin ohuiden tapausten kohdalla ylemmillä mittaustasoilla. Maston havaintohistorian (1989-2018) kaikki mittaukset eivät ole tutkimuksen kannalta käyttökelpoisia johtuen muun muassa mittauskorkeuksien vaihtumisesta havaintohistorian aikana. Aineiston havaintojen keskihajonta oli siten kautta linjan melko suuri suppeasta otoskoosta johtuen. Vahvempien johtopäätösten tekemiseksi vaadittaisiin enemmän havaintoja. Lisäksi Kivenlahden mastoon asennettavasta näkyvysmittarista olisi suuri apu sumutilanteiden tarkemman tunnistamisen mahdollistamiseksi, sillä toisaiseksi vaihtelevien etäisyyksien päässä sijaitsevat näkyvyysmittarit eivät anna tarkkaa kuvaa Kivenlahden näkyvyysolosuhteista.
Satelliitti-instrumentti IASI ja sen sovellukset sääpalvelulle

Perttula, Tuuli (Helsingin yliopistoHelsingfors universitetUniversity of Helsinki, 2010)

IASI on vuodesta 2007 käytössä ollut satelliitti-instrumentti Metop-polaarisatelliitissa. IASI-mittauksista johdettuja lopputuotteita ovat mm. lämpötilan ja kosteuden pystyprofiilit, pilven ylärajan lämpötila ja paine sekä eri hivenkaasujen pitoisuudet. Tämä työ on alkua IASI:n käyttöönotolle Ilmatieteen laitoksella. Työssä selvitetään IASI:n lämpötilaprofiileiden ja pilven ylärajan paineen soveltuvuutta sääpalvelulle vertailemalla niitä jo käytössä oleviin sääpalvelun työkaluihin. IASI-mittauksista johdettuja lämpötilaprofiileita verrataan Ilmatieteen laitoksen operatiivisiin pintaluotauksiin. Lisäksi tarkastellaan IASI:n lämpötilaprofiileiden vaikutusta paikallisen analyysi- ja ennustustyökalu LAPS:in lämpötila-analyysiin. IASI:n pilven ylärajan lämpötiloja verrataan AVHRR-radiometrista johdettuihin pilven ylärajan lämpötiloihin. IASI:sta ja AVHRR:stä johdetut keskimääräiset lämpötilat olivat lähes samat alapilville. Yläpilville ja osittain läpinäkyville cirrus-pilville lämpötilaero oli noin 5 Celsius-astetta. IASI:n lämpötilaluotaukset osoittautuivat käyttökelpoisiksi etenkin keski- ja ylätroposfäärissä (350 - 600 hPa), jossa IASI:n antama lämpötila erosi pintaluotauksen lämpötilasta vain noin ±1 Celsius-astetta. IASI:n lämpötilaluotausten suurin ongelma on mittausten katkeaminen pilven ylärajaan. IASI-luotauksilla oli suuri vaikutus LAPS-lämpötila-analyysiin mallin koko alueella, mutta vertailuaineiston puutteessa ei voida varmasti sanoa onko vaikutus positiivinen vai negatiivinen. Tulokset ovat lupaavia. IASI:n lämpötilaluotaukset ja pilven ylärajan lämpötila vaikuttavat käyttökelpoisilta sääpalvelun tarpeisiin.
Satunnaismatriisien ominaisarvoista ja niiden sovelluksista

Pylvänäinen, Annika (2012)

Tämän Pro gradun aiheena on satunnaismatriisien ominaisarvojen jakautuminen ja jakauman soveltaminen. Keskitytään erityisesti gaussisiin matriisiensembleihin, toisin sanoen matriisikokoelmiin, joiden alkiot noudattavat normaalijakaumaa. Tämän jakaumaoletuksen pätiessä teoriaa sovelletaan dffuusio-MRI tutkimukseen. Ensimmäisessä luvussa tarkastellaan matriisien ominaisuuksia, jotka ovat keskeisessä roolissa satunnaismatriisien teoriassa. Määritellään neliömatriisin ominaisuuksia kuten matriisin neliömuoto ja determinantti. Määritellään lisäksi matriisiarvoinen satunnaismuuttuja ja sen seurauksena keskitytään satunnaismatriiseihin. Todennäköisyys on keskeinen työkalu satunnaisuutta käsiteltäessä ja määritelläänkin todennäköisyysteorian peruselementtejä. Niiden avulla voidaan laskea satunnaismatriisin multinormaalijakauma sekä sen ominaisarvot ja -vektorit. Luvussa 2 määritellään Wignerin reaalinen symmetrinen- ja Wignerin hermiittinen matriisi. Perehdytään ennen kaikkea gaussiseen ortogonaaliseen (GOE)- ja gaussiseen unitaariseen matriisiensembleen (GUE), jotka ovat Wignerin matriisien erikoistapauksia. Tarkastellaan gaussisten matriisien jakaumaa ja erityisesti lasketaan matriisin ominaisarvojen jakauma. Se on tämän Pro Gradun keskeisempiä tuloksia ja sitä voidaan luvussa 3 soveltaa myös magneettikuvauksen teoriaan. Määritetään lisäksi Mehtan ja Selbergin integraalit, joiden avulla voidaan määrittää jakauman normalisointivakio. Lopuksi tarkastellaan diffuusiotensori- ja diffuusiopainotteista magneettikuvausta. Kuvataan ensin veden di_uusiota toisen asteen tensoreiden ja diffuusiofunktion avulla. Tämä on kolmiulotteinen malli, joka kuvaa diffuusion suuntaa kudoksessa. Monimutkaisempien diffuusioprofiilien, kuten kudosten hienorakenteiden sekä kuitujen leikkauskohtien tarkastelemiseen tarvitaan korkeamman asteen tensoreita. Tutustutaan niiden käyttöön sekä käytön vaatimiin rajoituksiin. Tarkastellaan sekä vektorin että tensorin jakaumia. Määritellään lisäksi rajoitteet, jotka vaaditaan algebrallisten ja geometristen ominaisuuksien säilymiseen muuntautuessa vektori- ja tenroriarvoisten muuttujien välillä. Lasketaan myös jakauman normalisointivakio. Lopuksi tarkastellaan isotrooppisen tensorin ominaisarvojen jakaumaa.
Satunnaismetsä-koneoppimismenetelmä, teoria ja soveltaminen

Telivuo, Suvi (2018)

Taloudellisten, institutionaalisten ja teknologisten ympäristöjen kiihtyvä muutostahti on luonut tarpeen tehdä oikeita valintoja menestymisen ja kehityksen takeeksi. Parhaimman valinnan tekee henkilö, jolla on eniten tietoa ja varmuutta tiedon paikkansapitävyydestä. Vaihtoehtoisesti päätöksenteon epävarmuustekijöitä voidaan hallita eliminointimenetelmillä, joiden hyödyntäminen voi myös johtaa parempiin päätöksiin. Epävarmuuden minimoimisen edellytyksenä on niin ikään pohjatietojen parantaminen. Tästä tarpeesta ovat nousseet tiedonlouhintamenetelmät. Tiedonlouhintamenetelmiä on kehittynyt valtava määrä vastaamaan kysynnän luomia tarpeita. Päätöksentekopuu on eräs tällainen analysointimenetelmä ja päätöksentekopuun pohjalta on luotu koneoppimismenetelmä satunnaismetsä. Satunnaismetsä on tutkimusten mukaan tällä hetkellä paras saatavilla oleva luokittelumenetelmä ja valikoitunut tämän tutkielman aiheeksi. Luvussa 2 luomme pohjaa satunnaismetsä-menetelmän ymmärtämiseksi. Lähdemme liikkeelle koneoppimisesta ja datalouhintamenetelmistä, joilla alustamme päätöksentekopuutyökalun. Käy ilmi, että on olemassa luokittelu-, regressio- ja luokitteluregressiopuita, ja että tässä tutkielmassa keskitymme luokittelupuihin. Tämän jälkeen esittelemme päätöksentekopuun metodologiaa. Luvussa 3 esittelemme tutkielman kannalta päätöksentekopuiden tärkeimmät validointimenetelmät, sillä datalouhinnassa analysointimenetelmien validoiminen on yhtä tärkeää kuin itse analysoiminen. Esittelemme mallien validointiin liittyviä käsitteitä, kuten tarkkuus, yleistysvirhe ja ylisovittuminen. Käymme läpi yleisimpiä tapoja validoida malleja, sekä näytämme esimerkkien kautta työkalut, joita käytämme tutkielmassa. Näitä ovat tarkkuus, väärinluokittelumatriisi, ROC-, kumulatiivinen saanti- ja nostokäyrä. Luvussa 4 esittelemme satunnaismetsä-koneoppimismenetelmän. Käymme ensiksi läpi joukko-oppimisen metodologiaa, jonka jälkeen käsittelemme satunnaismetsän algoritmin. Osoitamme teoreettisesti, miksi satunnaismetsä on parempi luokittelija kuin esimerkiksi päätöksentekopuu näyttämällä, että satunnaismetsän puumäärän kasvaessa satunnaismetsän yleistysvirhe suppenee kohti nollaa. Analysoimme teoreettisesti satunnaismetsän hyötyjä ja haittoja. Satunnaismetsän hyötyjä ovat sen tarkkuus, nopeus, ymmärrettävyys, toimivuus valtavilla datamäärillä, sekä kykeneväisyys analysoida tietojoukon merkittävimpiä muuttujia. Haittoja ovat, että satunnaismetsä ei suoriudu yhtä hyvin regressio-ongelmissa kuin esimerkiksi logistinen regressiomalli, sekä huono sovellettavuus pieniin tietojoukkoihin. Luvussa 5 sovellamme opittuja taitoja suppeaan tietojoukkoon. Tarkoituksenamme on arvioida RStudion ja SAS Enterprise Minerin satunnaismetsä-pakettien toimivuutta tunnetulla syötejoukolla. Analysoimme satunnaismetsäin suoriutumista luokittelutehtävässä ja vertailemme tuloksia päätöksentekopuuhun ja regressiomalliin. Hyödynnämme luvussa 3 opittuja validointimenetelmiä. Käy ilmi, että RStudion ja SAS Enterprise Minerin satunnaismetsä-paketit toimivat hyvin, ja että satunnaismetsä suoriutuu pienenkin tietojoukon luokittelussa malleista parhaiten. Luvussa 6 sovellamme satunnaismetsää yrityksen tarjoamaan haasteeseen, jossa tarkoitus on selittää ja ennustaa uusien asiakkaiden tietyn asiakassegmentin asiakasvaihtuvuutta. Käytämme yhtiön tarjoamia tietokantoja ja SAS Enterprise Miner-työkalua. Suoritamme vertailun satunnaismetsä- ja päätöksentekopuu-mallien välillä käyttämällä luvussa 3 esiteltyjä validointimenetelmiä ja analysoimme tulokset. Käy ilmi, että ROC-käyrien ja tarkkuuden perusteella satunnaismetsä suoriutuu sekä luokittelussa että ennustamisessa paremmin kuin päätöksentekopuu. Luvussa 7 pohdimme, millaisissa puitteissa satunnaismetsä soveltuu yrityksen liiketoimintaprosessiin. Käymme läpi vaatimuksia, joita satunnaismetsän soveltaminen asettaa, sekä mitä lisäarvoa satunnaismetsä menetelmänä tuo yritykselle. Tulos on, että satunnaismetsä soveltuu hyvin yrityksille, jotka hyödyntävät SAS-työkaluja ja tuo lisäarvoa analysointitehtäviin olemalla ymmärrettävä malli, mutta kuitenkin monipuolinen, nopea ja tarkka.
Savipartikkelien pinnan ATRP-oksastus tuoksumolekyylien absorptio-ominaisuuksien parantamiseksi

Ahola, Johanna (2017)

Tässä työssä tutkittiin poly(2-(dimetyyliamino)etyyli metakrylaatin) (PDMAEMA) oksastamista eri savipartikkelien pintaan kontrolloidulla radikaalipolymerointimenetelmällä ja sen vaikutusta partikkelien kykyyn absorboida tuoksumolekyylejä (tässä appelsiiniöljy) verrattuna puhtaisiin saviin. Lisäksi pyrittiin selvittämään, miten polymerointi vaikuttaa tuoksun pidättymisaikaan savimateriaaleissa ja onko tuoksun vapautuminen polymeroinnin jälkeen kontrolloitua. Ennen tuoksumolekyylien imeytymis- ja vapautumistutkimusta tavoitteena oli löytää toistettava pintainitioitu atominsiirtoradikaalipolymerointi-synteesimenetelmä (SI-ATRP) savi/polymeerikomposiittien valmistamiseksi. Savi/PDMAEMA-komposiitit valmistettiin syntetisoimalla ensin savi/aminosilaanikomposiitteja, joihin liitettiin initiaattori 2-bromoisobutyryylibromidi. DMAEMA:n polymerointi suoritettiin savi/initiaattori-komposiittien pintaan ‘grafting from’ -tekniikalla käyttäen ATRP-menetelmää. Savina käytettiin montmorilloniittia, halloisiittia ja wollastoniittia, joista montmorilloniitti- ja halloisiitti/PDMAEMA-komposiittien syntetisoinnissa onnistuttiin. Lähtöainesavien rakenne ja dimensiot tutkittiin kuvaamalla ne kenttäemissiopyyhkäisy-elektronimikroskoopilla. Väli- ja lopputuotteiden rakenteet karakterisoitiin IR- ja 1H-NMR-spektrometrisesti sekä termogravimetrisesti. Imeytyneen/vapautuneen appelsiiniöljyn määrä ja öljyn pysyminen savissa ja savi/PDMAEMA-komposiiteissa todennettiin termogravimetrisesti (dynaamisilla ja isotermisilla TGA-määrityksillä). Polymeerien ketjunpituudet ja polydispersiteetit määritettiin kokoekskluusiokromatografisesti (GPC). Savi/PDMAEMA-komposiittien valmistamiseksi löydettiin toimiva SI-ATRP-menetelmä, ja polymeeriketjujen kiinnittyminen saven pintaan todistettiin. Tutkimus osoitti PDMAEMA-oksastuksen vaikuttavan appelsiiniöljyn imeytymis- ja vapautumisominaisuuksiin siten, että öljyä imeytyi oksastettuun komposiittiin enemmän ja se pysyi komposiittimateriaalissa kauemmin verrattuna puhtaisiin saviin. Vaikka tutkimuksen tulokset osoittivat polymeroinnin merkittävän hyödyn tuoksuominaisuuksien parantamisessa, täysin kontrolloitua vapautumissysteemiä ei onnistuttu luomaan.
Scalable and High Available Kubernetes Cluster in Edge Environments for IoT Applications

Hyeongju, Lee (2021)

The number of IoT and sensor devices is expected to reach 25 billion by 2030. Many IoT appli- cations, such as connected vehicle and smart factory that require high availability, scalability, low latency, and security have appeared in the world. There have been many attempts to use cloud computing for IoT applications, but the mentioned requirements cannot be ensured in cloud environments. To solve this problem, edge computing has appeared in the world. In edge environments, containerization technology is useful to deploy apps with limited resources. In this thesis, two types of high available Kubernetes architecture (2 nodes with an external DB and 3 nodes with embedded DB) were surveyed and implemented using K3s distribution that is suitable for edges. By having a few experiments with the implemented K3s clusters, this thesis shows that the K3s clusters can provide high availability and scalability. We discuss the limitations of the implementations and provide possible solutions too. In addition, we provide the resource usages of each cluster in terms of CPU, RAM, and disk. Both clusters need only less than 10% CPU and about 500MB RAM on average. However, we could see that the 3 nodes cluster with embedded DB uses more resources than the 2 nodes + external DB cluster when changing the status of clusters. Finally, we show that the implemented K3s clusters are suitable for many IoT applications such as connected vehicle and smart factory. If an application that needs high availability and scalability has to be deployed in edge environments, the K3s clusters can provide good solutions to achieve the goals of the applications. The 2 nodes + external DB cluster is suitable for the applications where the amount of data fluctuate often, or where there is a stable connection with the external DB. On the other hand, the 3 nodes cluster will be suitable for the applications that need high availability of the database even in poor internet connection. ACM Computing Classification System (CCS) Computer systems organization → Embedded and cyber-physical systems Human-centered computing → Ubiquitous and mobile computing
Scalable and High Available Kubernetes Cluster in Edge Environments for IoT Applications

Lee, Hyeongju (2021)

The number of IoT and sensor devices is expected to reach 25 billion by 2030. Many IoT appli- cations, such as connected vehicle and smart factory that require high availability, scalability, low latency, and security have appeared in the world. There have been many attempts to use cloud computing for IoT applications, but the mentioned requirements cannot be ensured in cloud environments. To solve this problem, edge computing has appeared in the world. In edge environments, containerization technology is useful to deploy apps with limited resources. In this thesis, two types of high available Kubernetes architecture (2 nodes with an external DB and 3 nodes with embedded DB) were surveyed and implemented using K3s distribution that is suitable for edges. By having a few experiments with the implemented K3s clusters, this thesis shows that the K3s clusters can provide high availability and scalability. We discuss the limitations of the implementations and provide possible solutions too. In addition, we provide the resource usages of each cluster in terms of CPU, RAM, and disk. Both clusters need only less than 10% CPU and about 500MB RAM on average. However, we could see that the 3 nodes cluster with embedded DB uses more resources than the 2 nodes + external DB cluster when changing the status of clusters. Finally, we show that the implemented K3s clusters are suitable for many IoT applications such as connected vehicle and smart factory. If an application that needs high availability and scalability has to be deployed in edge environments, the K3s clusters can provide good solutions to achieve the goals of the applications. The 2 nodes + external DB cluster is suitable for the applications where the amount of data fluctuate often, or where there is a stable connection with the external DB. On the other hand, the 3 nodes cluster will be suitable for the applications that need high availability of the database even in poor internet connection. ACM Computing Classification System (CCS) Computer systems organization → Embedded and cyber-physical systems Human-centered computing → Ubiquitous and mobile computing
Scalable Bayesian Induction of Word Embeddings

Sakaya, Joseph Hosanna (2015)

Traditional natural language processing has been shown to have excessive reliance on human-annotated corpora. However, the recent successes of machine translation and speech recognition, ascribed to the effective use of the increasingly availability of web-scale data in the wild, has given momentum to a re-surging interest in attempting to model natural language with simple statistical models, such as the n-gram model, that are easily scaled. Indeed, words and word combinations provide all the representational machinery one needs for solving many natural language tasks. The degree of semantic similarity between two words is a function of the similarity of the linguistic contexts in which they appear. Word representations are mathematical objects, often vectors, that capture syntactic and semantic properties of a word. This results in words that are semantic cognates having similar word representations, an important property that we will widely use. We claim that word representations provide a superb framework for unsupervised learning on unlabelled data by compactly representing the distributional properties of words. The current state-of-the-art word representation adopts the skip-gram model to train shallow neural networks and presents negative sampling, an idea borrowed from Noise Contrastive Estimation, as an efficient method of inducing embeddings. An alternative approach contends that the inherent multi-contextual nature of words entails a more Canonical Correlation Analysis-like approach for best results. In this thesis we develop the first fully Bayesian model to induce word embeddings. The prominent contributions of this thesis are: 1. A crystallisation of the best practices from previous literature on word embeddings and matrix factorisation into a single hierarchical Bayesian model. 2. A scalable matrix factorisation technique for structured sparse data. 3. Representation of the latent dimensions as continuous Gaussian densities instead of as point estimates. We analyse a corpus of 170 million tokens and learn for each word form a vectorial representation based on the 8 surrounding context words with a negative sampling rate of 2 per token. We would like to stress that while we certainly hope to beat the state-of-the-art, our primary goal is to develop a stochastic and scalable Bayesian model. We evaluate the quality of the word embeddings against the word analogy tasks as well as other such tasks as word similarity and chunking. We demonstrate competitive performance on standard benchmarks.
Schizophrenic graft copolymers and their stimuli-responsive self-assembly behavior

Jiang, Tao (2017)

Dually thermoresponsive poly(sulfobetaine methacylate)-graft-(poly(poly(ethylene glycol) methyl ether methacrylate)-co-poly(di(ethylene glycol) methyl ether methacrylate) were synthesized via single electron transfer living radical polymerization (SET-LRP). Two different such graft copolymers S70-g-P25D25 and S70-g-P70D280 with different side chain lengths were prepared and studied. These polymers showed 'schezophrenic' self-assembly behavior in response to temperature and ionic strength in aqeuous solution in water. S70-g-P25D25 formed nanostructures at temperatures both above the lower critical solution temperature (LCST) and below the upper critical solution temperature (UCST) with inversed core-shell nature in aqueous solution. Under saline condition no nano structure could be observed at temperatures below the UCST. For S70-g-P70D280, LCST type self-assembly was observed with the formation of similar nanostructures, but at temperatures below UCST, instead of intermolecular aggregation, unimolecular self-assembly was obsverved due to the much more crowded side chains.
School segregation and declining educational outcomes : An analysis of urban and school segregation and the possibility of neighbourhood effects in upper comprehensive schools in Helsinki

Suomalainen, Aino (2020)

This Master’s thesis studies the mechanisms connected to negative changes in educational outcomes in upper comprehensive schools in Helsinki. What are the factors associated with negative changes in educational outcomes of individual students during the transition from 7th to 9th grade? There is an increased socioeconomic and ethnic segregation in Helsinki Metropolitan Area, and the differences between schools’ levels of success have also been growing throughout the 21st century. There is little research on combining schools and city development in Finland. The aim is to examine is there an association between decreasing individual educational outcomes and socio-spatial or school segregation, and to look at what is the role of individual factors and social context in decreased educational outcomes. Studying pupils and schools is a good way to capture local processes of differentiation and neighbourhood effect, because children and youth are especially prone to neighbourhood and school effects due to their ongoing process of socialization, localized lives in their neighbourhood and shared institutions, such as school. This study is conducted quantitatively, and the main method in this study is hierarchical linear regression. The data is from Metropolitan Longitudinal Finland research, which studies the success and wellbeing of pupils in upper comprehensive schools in the Helsinki Metropolitan area. The study was conducted during the Fall of 2011 and the Spring of 2014 tracking the same cohort when the pupils were in their 7th and 9th grades. The results suggest that there are no differences found between schools, but some of the qualities describing neighborhoods indicate that some neighbourhood effect might be found. There are indications that pupils with decreased educational outcomes are more likely to study in schools that are located in low income areas than higher income areas. Also, for pupils with decreased educational outcomes, attending a school that is located in Northern or Southeastern Great districts is more likely than attending a school in Eastern Great district. Based on the results, pupils with negative change in educational outcomes are more likely to spend time with friends of own area than with school friends. Boys have a bigger risk for a negative change in educational outcomes than girls, and the change of school is connected to decreased educational outcomes. Mother’s education and immigration background was not found to have connection with decreased educational outcomes. Decreased educational outcomes have a connection with a low parents’ pedagogical ethos, but no connection with peers’ pedagogical ethos was found. The results are significant from the perspective of urban and educational politics and planning. The indications that the educational outcomes in upper comprehensive schools in Helsinki are differentiated in neighborhood level for example between Great districts, and in individual level between genders, challenge the goals of equal educational opportunities. Also, urban planning should be targeted to prevent socio-spatial differentiation of neighborhoods, in order to combat differentiation in schools’ composition of pupils. In future research, the starting level of educational success could be studied more closely- does decrease in educational outcomes implicate different educational paths for pupils that start with high starting level than pupils that have lower starting level in the beginning? This study provided information that there are no differences between schools found currently, but the processes of differentiation are not stable, so the processes should be observed continuously.
Schwarz-Christoffelin kaava

Jalkanen, Matias (2018)

Tämän tutkielman tarkoitus on perehtyä analyyttisten funktioiden konformisuuteen ja erityisesti esittää Schawrz-Christoffelin kaava, joka on konformikuvaus ylemmältä puolitasolta yksinkertaiselle monikulmiolle. Tutkielma on jaettu kahteen osaan, joista ensimmäinen on Riemannin kuvauslause. Kappale on rakennettu niin, että perehdytään ensin analyyttisten funktioiden ominaisuuksiin, joita Riemannin kuvauslauseen todistuksessa tarvitaan. Näitä ovat logaritmin ominaisuudet, funktiojonon normaali suppeneminen ja konformisuus. Riemannin kuvauslauseen todistamisen jälkeen aloitetaan perehtymään Schwarz-Christoffelin kaavaan. Kappaleessa aloitetaan helposti ymmärrettävällä esimerkillä Schwarz-Christoffelin kaavan ideasta, jota kehitetään itse kaavaksi. Loppuosassa osoitetaan, että kaava on itseasiassa konforminen. Tämä vaatii tarkkaa analyysiä funktion käytöksestä kuvauksen reunalla.

Now showing items 3062-3081 of 4026

Browsing by Title

Yhteystiedot

HELSINGIN YLIOPISTO