Browsing by master's degree program "Master 's Programme in Mathematics and Statistics"

Now showing items 1-20 of 73

A Bayesian Approach to Non-convex Asteroid Shape Determination

Huggins, Robert (2023)

In this thesis, we develop a Bayesian approach to the inverse problem of inferring the shape of an asteroid from time-series measurements of its brightness. We define a probabilistic model over possibly non-convex asteroid shapes, choosing parameters carefully to avoid potential identifiability issues. Applying this probabilistic model to synthetic observations and sampling from the posterior via Markov Chain Monte Carlo, we show that the model is able to recover the asteroid shape well in the limit of many well-separated observations, and is able to capture posterior uncertainty in the case of limited observations. We greatly accelerate the computation of the forward problem (predicting the measured light curve given the asteroid’s shape parameters) by using a bounding volume hierarchy and by exploiting data parallelism on a graphics processing unit.
Aihemallinnus Suomi24-keskustelufoorumin aineistolla toimeentulotukilain muutokseen liittyvissä keskusteluissa

Nenonen, Veera (2022)

Sosiaalietuudet ovat kokeneet monenlaisia muutoksia vuosien aikana, ja niihin liittyviä lakeja pyritään kehittämään jatkuvasti. Myös aivan viimesijaiseen valtion tarjoamaan taloudellisen tuen muotoon, toimeentulotukeen, on kohdistettu merkittäviä toimenpiteitä, mikä on vaikuttanut useiden suomalaisten elämään. Näistä toimenpiteistä erityisesti perustoimeentulotuen siirtäminen Kansaneläkelaitoksen vastuulle on vaatinut paljon sopeutumiskykyä tukea käsitteleviltä ja hakevilta tahoilta. Tämä on voinut herättää voimakkaitakin mielipiteitä, joiden ilmaisuun keskustelufoorumit ovat otollinen alusta. Suomen suurin keskustelufoorumi Suomi24 sisältää paljon yhteiskuntaan ja politiikkaan liittyviä keskusteluketjuja, joiden sisällön kartoittaminen kiinnostaviin aiheisiin liittyen voi tuottaa oikeanlaisilla menetelmillä mielenkiintoista ja hyödyllistä tietoa. Tässä tutkielmassa pyritään luonnollisen kielen prosessoinnin menetelmiä, tarkemmin aihemallinnusta, hyödyntämällä selvittämään, onko vuonna 2017 voimaan tulleen toimeentulotukilain muutos mahdollisesti näkynyt jollakin tavalla Suomi24-foorumin toimeentulotukea käsittelevissä keskusteluissa. Tutkimus toteutetaan havainnollistamalla valittua aineistoa erilaisilla visualisoinneilla sekä soveltamalla LDA algoritmia, ja näiden avulla yritetään havaita keskusteluiden keskeisimmät aiheet ja niihin liittyvät käsitteet. Jos toimeentulotukilain muutos on herättänyt keskustelua, se voisi ilmetä aiheista sekä niiden sisältämien sanojen käytön jakautumisesta ajalle ennen muutosta ja sen jälkeen. Myös aineiston rajaus ja poiminta tietokannasta, sekä aineiston esikäsittely aihemallinnusta varten kattaa merkittävän osan tutkimuksesta. Aineistoa testataan yhteensä kaksi kertaa, sillä ensimmäisellä kerralla havaitaan puutteita esikäsittelyvaiheessa sekä mallin sovittamisessa. Iterointi ei ole epätavanomaista tällaisissa tutkimuksissa, sillä vasta tuloksia tulkitessa saattaa nousta esille asioita, jotka olisi pitänyt ottaa huomioon jo edeltävissä vaiheissa. Toisella testauskerralla aiheiden sisällöistä nousi esille joitain mielenkiintoisia havaintoja, mutta niiden perusteella on vaikea tehdä päätelmiä siitä, näkyykö toimeentulotukilain muutos keskustelualustan viesteistä.
A multistate analysis of ulcerative colitis and colorectal cancer

Tanskanen, Tomas (2022)

Colorectal cancer (CRC) accounts for one in 10 new cancer cases worldwide. CRC risk is determined by a complex interplay of constitutional, behavioral, and environmental factors. Patients with ulcerative colitis (UC) are at increased risk of CRC, but effect estimates are heterogeneous, and many studies are limited by small numbers of events. Furthermore, it has been challenging to distinguish the effects of age at UC diagnosis and duration of UC. Multistate models provide a useful statistical framework for analyses of cancers and premalignant conditions. This thesis has three aims: to review the mathematical and statistical background of multistate models; to study maximum likelihood estimation in the illness-death model with piecewise constant hazards; and to apply the illness-death model to UC and CRC in a population-based cohort study in Finland in 2000–2017, considering UC as a premalignant state that may precede CRC. A likelihood function is derived for multistate models under noninformative censoring. The multistate process is considered as a multivariate counting process, and product integration is reviewed. The likelihood is constructed by partitioning the study time into subintervals and finding the limit as the number of subintervals tends to infinity. Two special cases of the illness-death model with piecewise constant hazards are studied: a simple Markov model and a non-Markov model with multiple time scales. In the latter case, the likelihood is factorized into terms proportional to Poisson likelihoods, which permits estimation with standard software for generalized linear models. The illness-death model was applied to study the relationship between UC and CRC in a population-based sample of 2.5 million individuals in Finland in 2000–2017. Dates of UC and CRC diagnoses were obtained from the Finnish Care Register for Health Care and the Finnish Cancer Registry, respectively. Individuals with prevalent CRC were excluded from the study cohort. Individuals in the study cohort were followed from January 1, 2000, to the date of first CRC diagnosis, death from other cause, emigration, or December 31, 2017, whichever came first. A total of 23,533 incident CRCs were diagnosed during 41 million person-years of follow-up. In addition to 8,630 patients with prevalent UC, there were 19,435 cases of incident UC. Of the 23,533 incident CRCs, 298 (1.3%) were diagnosed in patients with pre-existing UC. In the first year after UC diagnosis, the HR for incident CRC was 4.67 (95% CI: 3.07, 7.09) in females and 7.62 (95% CI: 5.65, 10.3) in males. In patients with UC diagnosed 1–3 or 4–9 years earlier, CRC incidence did not differ from persons without UC. When 10–19 years had passed from UC diagnosis, the HR for incident CRC was 1.63 (95% CI: 1.19, 2.24) in females and 1.29 (95% CI: 0.96, 1.75) in males, and after 20 years, the HR was 1.61 (95% CI: 1.13, 2.31) in females and 1.74 (95% CI: 1.31, 2.31) in males. Early-onset UC (age <40 years) was associated with a markedly increased long-term risk of CRC. The HR for CRC in early-onset UC was 4.13 (95% CI: 2.28, 7.47) between 4–9 years from UC diagnosis, 4.88 (95% CI: 3.46, 6.88) between 10–19 years, and 2.63 (95% CI: 2.01, 3.43) after 20 years. In this large population-based cohort study, we estimated CRC risk in persons with and without UC in Finland in 2000–2017, considering both the duration of UC and age at UC diagnosis. Patients with early-onset UC are at increased risk of CRC, but the risk is likely to depend on disease duration, extent of disease, attained age, and other risk factors. Increased CRC risk in the first year after UC diagnosis may be in part due to detection bias, whereas chronic inflammation may underlie the long-term excess risk of CRC in patients with UC.
ARMA-mallien robusti estimointi

Aholainen, Kusti (2022)

Tämän tutkielman tarkoitus on tarkastella robustien estimaattorien, erityisesti BMM- estimaattorin, soveltuvuutta ARMA(p, q)-prosessin parametrien estimointiin. Robustit estimaattorit ovat estimaattoreita, joilla pyritään hallitsemaan poikkeavien havaintojen eli outlierien vaikutusta estimaatteihin. Robusti estimaattori sietääkin outliereita siten, että outlierien läsnäololla havainnoissa ei ole merkittävää vaikutusta estimaatteihin. Outliereita vastaan saatu suoja kuitenkin yleensä näkyy menetettynä tehokkuutena suhteessa suurimman uskottavuuden menetelmään. BMM-estimaattori on Mulerin, Peñan ja Yohain Robust estimation for ARMA models-artikkelissa (2009) esittelemä MM-estimaattorin laajennus. BMM-estimaattori pohjautuu ARMA-mallin apumalliksi kehitettyyn BIP-ARMA-malliin, jossa innovaatiotermin vaikutusta rajoitetaan suodattimella. Ajatuksena on näin kontrolloida ARMA-mallin innovaatioissa esiintyvien outlierien vaikutusta. Tutkielmassa BMM- ja MM- estimaattoria verrataan klassisista menetelmistä suurimman uskottavuuden (SU) ja pienimmän neliösumman (PNS) menetelmiin. Tutkielman alussa esitetään tarvittava todennäköisyysteorian, aikasarja-analyysin sekä robustien menetelmien käsitteistö. Lukija tutustutetaan robusteihin estimaattoreihin ja motivaatioon robustien menetelmien taustalla. Outliereita sisältäviä aikasarjoja käsitellään tutkielmassa asymptoottisesti saastuneen ARMA-prosessin realisaatioina ja keskeisimmille kirjallisuudessa tunnetuille outlier-prosesseille annetaan määritelmät. Lisäksi kuvataan käsiteltyjen BMM-, MM-, SU- ja PNS-estimaattorien laskenta. Estimaattorien yhteydessä käsitellään lisäksi alkuarvomenetelmiä, joilla estimaattorien minimointialgoritmien käyttämät alkuarvot valitaan. Tutkielman teoriaosuudessa esitetään lauseet ja todistukset MM-estimaattorin tarkentuvuudesta ja asymptoottisesta normaaliudesta. Kirjallisuudessa ei kuitenkaan tunneta todistusta BMM-estimaattorin vastaaville ominaisuuksille, vaan samojen ominaisuuksien otaksutaan pätevän myös BMM-estimaattorille. Tulososuudessa esitetään simulaatiot, jotka toistavat Muler et al. artikkelissa esitetyt simulaatiot monimutkaisemmille ARMA-malleille. Simulaatioissa BMM- ja MM-estimaattoria verrataan keskineliövirheen suhteen SU- ja PNS-estimaattoreihin, verraten samalla eri alkuarvomenetelmiä samalla. Lisäksi estimaattorien asymptoottisia robustiusominaisuuksia käsitellään. Estimaattorien laskenta on toteutettu R- ohjelmistolla, missä BMM- ja MM-estimaattorien laskenta on toteutettu pääosin C++-kielellä. Liite käsittää BMM- ja MM- estimaattorien laskentaan tarvittavan lähdekoodin.
Axiomatizing modal inclusion logic

Häggblom, Matilda (2022)

Modal inclusion logic is modal logic extended with inclusion atoms. It is the modal variant of first-order inclusion logic, which was introduced by Galliani (2012). Inclusion logic is a main variant of dependence logic (Väänänen 2007). Dependence logic and its variants adopt team semantics, introduced by Hodges (1997). Under team semantics, a modal (inclusion) logic formula is evaluated in a set of states, called a team. The inclusion atom is a type of dependency atom, which describes that the possible values a sequence of formulas can obtain are values of another sequence of formulas. In this thesis, we introduce a sound and complete natural deduction system for modal inclusion logic, which is currently missing in the literature. The thesis consists of an introductory part, in which we recall the definitions and basic properties of modal logic and modal inclusion logic, followed by two main parts. The first part concerns the expressive power of modal inclusion logic. We review the result of Hella and Stumpf (2015) that modal inclusion logic is expressively complete: A class of Kripke models with teams is closed under unions, closed under k-bisimulation for some natural number k, and has the empty team property if and only if the class can be defined with a modal inclusion logic formula. Through the expressive completeness proof, we obtain characteristic formulas for classes with these three properties. This also provides a normal form for formulas in MIL. The proof of this result is due to Hella and Stumpf, and we suggest a simplification to the normal form by making it similar to the normal form introduced by Kontinen et al. (2014). In the second part, we introduce a sound and complete natural deduction proof system for modal inclusion logic. Our proof system builds on the proof systems defined for modal dependence logic and propositional inclusion logic by Yang (2017, 2022). We show the completeness theorem using the normal form of modal inclusion logic.
Can a neural network recognize seasons? Solving a classification problem based on weather observations

Kukkola, Johanna (2022)

Can a day be classified to the correct season on the basis of its hourly weather observations using a neural network model, and how accurately can this be done? This is the question this thesis aims to answer. The weather observation data was retrieved from Finnish Meteorological Institute’s website, and it includes the hourly weather observations from Kumpula observation station from years 2010-2020. The weather observations used for the classification were cloud amount, air pressure, precipitation amount, relative humidity, snow depth, air temperature, dew-point temperature, horizontal visibility, wind direction, gust speed and wind speed. There are four distinct seasons that can be experienced in Finland. In this thesis the seasons were defined as three-month periods, with winter consisting of December, January and February, spring consisting of March, April and May, summer consisting of June, July and August, and autumn consisting of September, October and November. The days in the weather data were classified into these seasons with a convolutional neural network model. The model included a convolutional layer followed by a fully connected layer, with the width of both layers being 16 nodes. The accuracy of the classification with this model was 0.80. The model performed better than a multinomial logistic regression model, which had accuracy of 0.75. It can be concluded that the classification task was satisfactorily successful. An interesting finding was that neither models ever confused summer and winter with each other.
Cash flow simulation and model comparison in private credit

Virtanen, Jussi (2022)

In the thesis we assess the ability of two different models to predict cash flows in private credit investment funds. Models are a stochastic type and a deterministic type which makes them quite different. The data that has been obtained for the analysis is divided in three subsamples. These subsamples are mature funds, liquidated funds and all funds. The data consists of 62 funds, subsample of mature funds 36 and subsample of liquidated funds 17 funds. Both of our models will be fitted for all subsamples. Parameters of the models are estimated with different techniques. The parameters of the Stochastic model are estimated with the conditional least squares method. The parameters of the Yale model are estimated with the numerical methods. After the estimation of the parameters, the values are explained in detail and their effect on the cash flows are investigated. This helps to understand what properties of the cash flows the models are able to capture. In addition, we assess to both models' ability to predict cash flows in the future. This is done by using the coefficient of determination, QQ-plots and comparison of predicted and observed cumulated cash flows. By using the coefficient of determination we try to explain how well the models explain the variation around the residuals of the observed and predicted values. With QQ-plots we try to determine if the values produced of the process follow the normal distribution. Finally, with the cumulated cash flows of contributions and distributions we try to determine if models are able to predict the cumulated committed capital and returns of the fund in a form of distributions. The results show that the Stochastic model performs better in its prediction of contributions and distributions. However, this is not the case for all the subsamples. The Yale model seems to do better in cumulated contributions of the subsample of the mature funds. Although, the flexibility of the Stochastic model is more suitable for different types of cash flows and subsamples. Therefore, it is suggested that the Stochastic model should be the model to be used in prediction and modelling of the private credit funds. It is harder to implement than the Yale model but it does provide more accurate results in its prediction.
Combinatorial Symmetries in Knot Theory : Yamada Polynomials from Transfer-Matrix Methods

Lundström, Teemu (2022)

Spatial graphs are graphs that are embedded in three-dimensional space. The study of such graphs is closely related to knot theory, but it is also motivated by practical applications, such as the linking of DNA and the study of chemical compounds. The Yamada polynomial is one of the most commonly used invariants of spatial graphs as it gives a lot of information about how the graphs sit in the space. However, computing the polynomial from a given graph can be computationally demanding. In this thesis, we study the Yamada polynomial of symmetrical spatial graphs. In addition to being symmetrical, the graphs we study have a layer-like structure which allows for certain transfer-matrix methods to be applied. There the idea is to express the polynomial of a graph with n layers in terms of graphs with n − 1 layers. This then allows one to obtain the polynomial of the original graph by computing powers of the so-called transfer-matrix. We introduce the Yamada polynomial and prove various properties related to it. We study two families of graphs and compute their Yamada polynomials. In addition to this, we introduce a new notational technique which allows one to ignore the crossings of certain spatial graphs and turn them into normal plane graphs with labelled edges. We prove various results related to this notation and show how it can be used to obtain the Yamada polynomial of these kinds of graphs. We also give a sketch of an algorithm with which one could, at least in principle, obtain the Yamada polynomials of larger families of graphs.
Comparing approximative inference methods for multi-class Gaussian process classification

Ronkainen, Arttu (2023)

Gaussiset prosessit ovat stokastisia prosesseja, joiden äärelliset osajoukot noudattavat multinormaa- lijakaumaa. Niihin pohjautuvien mallien käyttö on suosittua bayesiläisessä tilastotieteessä, sillä ne mahdollistavat monimutkaisten ajallisten tai avaruudellisten riippuvuuksien joustavan mallintami- sen. Gaussisen latentin muuttujan malleissa havaintojen oletetaan noudattavan ehdollista jakaumaa, joka riippuu priorijakaumaltaan gaussisen latentin prosessin saamista arvoista. Havaintoaineiston koostuessa kategorisista arvoista, ovat gaussisen latentin muuttujan mallit ovat laskennallisesti han- kalia, sillä latenttien muuttujien posteriorijakaumaa ei yleensä voida käsitellä analyyttisesti. Täl- löin posterioripäättelyyn on käytettävä analyyttisiä approksimaatioita tai numeerisia menetelmiä. Laskennalliset hankaluudet korostuvat entisestään, kun latentin gaussisen muuttujan kovarianssi- funktion parametreille asetetaan oma priorijakauma. Tässä työssä käsitellään approksimatiivisia menetelmiä, joita voidaan käyttää posterioripäättelyyn gaussisen latentin muuttujan malleissa. Työssä keskitytään pääasiallisesti usean luokan luokitte- lumalliin, jossa havaintomallina on softmax-funktio, mutta suuri osa esitellyistä ideoista on so- vellettavissa myös muille havaintomalleille. Tutkielmassa käsiteltäviä approksimatiivisia menetel- miä on kolme. Ensimmäinen menetelmä on bayesiläisessä tilastotieteessä usein käytetty, satunnai- sotantaan perustuva Markovin ketju Monte Carlo -menetelmä, joka on asymptoottisesti eksakti, mutta laskennallisesti raskas. Toinen menetelmä käyttää Laplace-approksimaatioksi kutsuttua ana- lyyttista approksimaatiota latentin muuttujan posteriorille, yhdessä Markovin ketju Monte Carlo -menetelmän kanssa. Kolmas menetelmä yhdistää Laplace-approksimaation ja hyperparametrien piste-estimoinnin. Käsiteltävien menetelmien perustana oleva teoria esitellään tutkielmassa mini- maalisesti, jonka jälkeen approksimatiivisten menetelmien suoriutumista vertaillaan usean luokan luokittelumallis- sa simuloidulla havaintoaineistolla. Vertailussa voidaan havaita Laplace-approksimaation vaikutus hyperparametrien, sekä latentin muuttujan posteriorijakaumiin.
Comparison of high-dimensional Bayesian variable selection methods with application in genetics

Laiho, Aleksi (2022)

In statistics, data can often be high-dimensional with a very large number of variables, often larger than the number of samples themselves. In such cases, selection of a relevant configuration of significant variables is often needed. One such case is in genetics, especially genome-wide association studies (GWAS). To select the relevant variables from high-dimensional data, there exists various statistical methods, with many of them relating to Bayesian statistics. This thesis aims to review and compare two such methods, FINEMAP and Sum of Single Effects (SuSiE). The methods are reviewed according to their accuracy of identifying the relevant configurations of variables and their computational efficiency, especially in the case where there exists high inter-variable correlations within the dataset. The methods were also compared to more conventional variable selection methods, such as LASSO. The results show that both FINEMAP and SuSiE outperform LASSO in terms of selection accuracy and efficiency, with FINEMAP producing sligthly more accurate results with the expense of computation time compared to SuSiE. These results can be used as guidelines in selecting an appropriate variable selection method based on the study and data.
Comparison of the Feedforward and Operator Recurrent Neural Networks for the Inverse Problem for the Wave Equation

Mäkinen, Sofia (2023)

In this thesis we consider the inverse problem for the one-dimensional wave equation. That is, we would like to recover the velocity function, the wave speed, from the equation given Neumann and Dirichlet boundary conditions, when the solution to the equation is known. It has been shown that an operator Λ corresponding to the boundary conditions determines the volumes of the domain of influence, which is the set where the travel time for the wave is limited. These volumes then in turn determine the velocity function. We present some theorems and propositions about determining the wave speed and present proofs for a few of them. Artificial neural networks are a form of machine learning widely used in various applications. It has been previously proven that a one-layer feedforward neural network with a non-polynomial activation function with some additional constraints on the activation function can approximate any continuous real valued functions. In this thesis we present proof of this result for a continuous non-polynomial activation function. Furthermore, in this thesis we apply two neural network architectures to the volume inversion problem, which means that we train the networks to approximate a single volume when the operator Λ is given. The neural networks in question are the feedforward neural network and the operator recurrent neural network. Before the volume inversion problem, we consider a simpler problem of finding an inverse matrix of a small invertible matrix. Finally, we compare the performances of these two neural networks for both the volume and matrix inversion problems.
Descriptive combinatorics and distributed algorithms

Särkijärvi, Joona (2023)

Both descriptive combinatorics and distributed algorithms are interested in solving graph problems with certain local constraints. This connection is not just superficial, as Bernshteyn showed in his seminal 2020 paper. This thesis focuses on that connection by restating the results of Bernshteyn. This work shows that a common theory of locality connects these fields. We also restate the results that connect these findings to continuous dynamics, where they found that solving a colouring problem on the free part of the subshift 2^Γ is equivalent to there being a fast LOCAL algorithm solving this problem on finite sections of the Cayley graph of Γ. We also restate the result on the continuous version of Lovász Local Lemma by Bernshteyn. The LLL is a powerful probabilistic tool used throughout combinatorics and distributed computing. They proved a version of the lemma that, under certain topological constraints, produces continuous solutions.
Developmental trajectories of height, weight and BMI across childhood : Bayesian hierarchical modeling of longitudinal data

Kauppala, Tuuli (2021)

Children’s height and weight development remains a subject of interest especially due to increasing prevalence of overweight and obesity in the children. With statistical modeling, height and weight development can be examined as separate or connected outcomes, aiding with understanding of the phenomenon of growth. As biological connection between height and weight development can be assumed, their joint modeling is expected to be beneficial. One more advantage of joint modeling is its convenience of the Body Mass Index (BMI) prediction. In the thesis, we modeled longitudinal data of children’s heights and weights of the dataset obtained from Finlapset register of the Institute of Health and Welfare (THL). The research aims were to predict the modeled quantities together with the BMI, interpret the obtained parameters with relation to the phenomenon of growth, as well as to investigate the impact of municipalities on to the growth of children. The dataset’s irregular, register-based nature together with positively skewed, heteroschedastic weight distributions and within- and between-subject variability suggested Hierarchical Linear Models (HLMs) as the modeling method of choice. We used HLMs in Bayesian setting with the benefits of incorporating existing knowledge, and obtaining full posterior predictive distribution for the outcome variables. HLMs were compared with the less suitable classical linear regression model, and bivariate and univariate HLMs with or without area as a covariate were compared in terms of their posterior predictive precision and accuracy. One of the main research questions was the model’s ability to predict the BMI of the child, which we assessed with various posterior predictive checks (PPC). The most suitable model was used to estimate growth parameters of 2-6 year old males and females in Vihti, Kirkkonummi and Tuusula. With the parameter estimates, we could compare growth of males and females, assess the differences of within-subject and between-subject variability on growth and examine correlation between height and weight development. Based on the work, we could conclude that the bivariate HLM constructed provided the most accurate and precise predictions especially for the BMI. The area covariates did not provide additional advantage to the models. Overall, Bayesian HLMs are a suitable tool for the register-based dataset of the work, and together with log-transformation of height and weight they can be used to model skewed and heteroschedastic longitudinal data. However, the modeling would ideally require more observations per individual than we had, and proper out-of-sample predictive evaluation would ensure that current models are not over-fitted with regards to the data. Nevertheless, the built models can already provide insight into contemporary Finnish childhood growth and to simulate and create predictions for the future BMI population distributions.
Dirichlet'n ongelma C^2:n hyperbolisessa yksikkökuulassa

Frosti, Miika (2022)

Tämä tutkielma käsittelee C^2:n hyperbolisessa yksikkökuulassa asetettuja Dirichlet'n ongelmia. Työn tavoitteena on löytää ongelman ratkaisujen joukosta ne funktiot, jotka ovat sileitä, eli rajattomasti derivoituvia. Tätä varten kuvaillaan aluksi R^2:n yksikköympyrässä ja puoliavaruudessa määritellyt Dirichlet'n ongelmat ja miten muodostaa niille ratkaisut. Molempien alueiden ongelmia varten luodaan aluekohtaiset Greenin funktiot, joiden avulla johdetaan Poissonin ydin. Tämän ytimen avulla saadaan sileä ratkaisu Dirichlet'n ongelmaan. Tämän jälkeen tutustutaan C^2:n hyperboliseen yksikkökuulaan, ja miten siinä määritellyt Dirichlet'n ongelmat eroavat R^2:n yksikkökuulan ongelmista. Aiheen kannalta merkittävintä on ero euklidisen ja hyperbolisen Laplace-Beltramin operaattorin ominaisuuksissa. Kun tärkeimmät eroavaisuudet ovat selvitetty, voidaan todistaa, että Poisson-Szegön ytimen avulla määritelty funktio ratkaisee Dirichlet'n ongelman. On kuitenkin mahdollista näyttää esimerkillä, että ratkaisut eivät ole välttämättä sileitä. Jotta näistä ratkaisuista voidaan erottaa sileät funktiot, on hyödynnettävä palloharmonisia funktioita. Näiden tärkeimpiä piirteitä kuvaillaan sekä reaaliavaruudessa että kompleksiavaruudessa. Näiden funktioiden ja hypergeometristen funktioiden avulla voidaan määritellä uusi muoto Poisson-Szegön ytimelle, josta voidaan puolestaan johtaa tutkielman lopputulos. Kyseiseksi lopputulokseksi saadaan se, että yksikkökuulan Dirichlet'n ongelmien ratkaisut ovat sileitä jos ja vain jos ratkaisut ovat pluriharmonisia.
Equilibrium in a Continuous-Time Overlapping Generations Model

Heikkuri, Vesa-Matti (2022)

This thesis studies equilibrium in a continuous-time overlapping generations (OLG) model. OLG models are used in economics to study the effect of demographics and life-cycle behavior on macroeconomic variables such as the interest rate and aggregate investment. These models are typically set in discrete time but continuous-time versions have also received attention recently for their desirable properties. Competitive equilibrium in a continuous-time OLG model can be represented as a solution to an integral equation. This equation is linear in the special case of logarithmic utility function. This thesis provides the necessary and sufficient conditions under which the linear equation is a convolution type integral equation and derives a distributional solution using Fourier transform. We also show that the operator norm of the integral operator is not generally less than one. Hence, the equation cannot be solved using Neumann series. However, in a special case the distributional solution is characterized by a geometric series on the Fourier side when the operator norm is equal to one.
Estimating missing company speciﬁc Intrastat export and import values with multivariate time series models

Lahdensuo, Sofia (2022)

The Finnish Customs collects and maintains the statistics of the Finnish intra-EU trade with the Intrastat system. Companies with significant intra-EU trade are obligated to give monthly Intrastat declarations, and the statistics of the Finnish intra-EU trade are compiled based on the information collected with the declarations. In case of a company not giving the declaration in time, there needs to exist an estimation method for the missing values. In this thesis we propose an automatic multivariate time series forecasting process for the estimation of the missing Intrastat import and export values. The forecasting is done separately for each company with missing values. For forecasting we use two dimensional time series models, where the other component is the import or export value of the company to be forecasted, and the other component is the import or export value of the industrial group of the company. To complement the time series forecasting we use forecast combining. Combined forecasts, for example the averages of the obtained forecasts, have been found to perform well in terms of forecast accuracy compared to the forecasts created by individual methods. In the forecasting process we use two multivariate time series models, the Vector Autoregressive (VAR) model, and a specific VAR model called the Vector Error Correction (VEC) model. The choice of the model is based on the stationary properties of the time series to be modelled. An alternative option for the VEC model is the so-called augmented VAR model, which is an over-fitted VAR model. We use the VEC model and the augmented VAR model together by using the average of the forecasts created with them as the forecast for the missing value. When the usual VAR model is used, only the forecast created by the single model is used. The forecasting process is created as automatic and as fast as possible, therefore the estimation of a time series model for a single company is made as simple as possible. Thus, only statistical tests which can be applied automatically are used in the model building. We compare the forecast accuracy of the forecasts created with the automatic forecasting process to the forecast accuracy of forecasts created with two simple forecasting methods. In the non-stationary-deemed time series the Naïve forecast performs well in terms of forecast accuracy compared to the time series model based forecasts. On the other hand, in the stationary-deemed time series the average over the past 12 months performs well as a forecast in terms of forecast accuracy compared to the time series model based forecasts. We also consider forecast combinations where the forecast combinations are created by calculating the average of the time series model based forecasts and the simple forecasts. In line with the literature, the forecast combinations perform overall better in terms of the forecast accuracy than the forecasts based on the individual models.
Estimating the distribution of rat population in Helsinki with non-stationary Gaussian model interpreting coastline as a physical barrier

Nikkanen, Leo (2022)

Often in spatial statistics the modelled domain contains physical barriers that can have impact on how the modelled phenomena behaves. This barrier can be, for example, land in case of modelling a fish population, or road for different animal populations. Common model that is used in spatial statistics is a stationary Gaussian model, because of its computational requirements, relatively easy interpretation of results. The physical barrier does not have an effect on this type of models unless the barrier is transformed into variable, but this can cause issues in the polygon selection. In this thesis I discuss how the non-stationary Gaussian model can be deployed in cases where spatial domain contains physical barriers. This non-stationary model reduces spatial correlation continuously towards zero in areas that are considered as a physical barrier. When the correlation is selected to reduce smoothly to zero, the model is more likely to results similar output with slightly different polygons. The advantage of the barrier model is that it is as fast to train as the stationary model because both models can be trained using finite equation method (FEM). With FEM we can solve stochastic partial differential equations (SPDE). This method interprets continuous random field as a discrete mesh, and the computational requirements increases as the number of nodes in mesh increases. In order to create stationary and non-stationary models, I have described the required methods such as Bayesian statistics, stochastic process, and covariance function in the second chapter. I use these methods to define spatial random effect model, and one commonly used spatial model is the Gaussian latent variable model. At the end of second chapter, I describe how the barrier model is created, and what types of requirements this model has. The barrier model is based on a Matern model that is a Gaussian random field, and it can be represented by using Matern covariance function. The second chapter ends with description of how to create a mesh mentioned above, and how the FEM is used to solve SPDE. The performance of stationary and non-stationary Gaussian models are first tested by training both models with simulated data. This simulated data is a random sample from polygon of Helsinki where the coastline is interpreted as a physical barrier. The results show that the barrier model estimates the true parameters better than the stationary model. The last chapter contains data analysis of the rat populations in Helsinki. The data contains number of rat observations in each zip code, and a set of covariates. Both models, stationary and non-stationary, are trained with and without covariates, and the best model out of these four models was the stationary model with covariates.
Evolution of pathogen virulence

Patieva, Fatima (2023)

In this thesis, we study epidemic models such as SIR and superinfection to demonstrate the coexistence as well as the competitive exclusion of all but one strain. We show that the strain that can keep its position under the worst environmental conditions cannot be invaded by any other strain when it comes to some models with a constant death rate. Otherwise, the optimization principle does not necessarily work. Nevertheless, Ackleh and Allen proved that in the SIR model with a density-dependent mortality rate and total cross-immunity the strain with the largest basic reproduction number is the winner in competitive exclusion. However, it must be taken into account that the conditions on the parameters used for the proof are sufficient but not necessary to exclude the coexistence of different pathogen strains. We show that the method can be applied to both density-dependent and frequency-dependent transmission incidence. In the latter half, we link the between and within-host models and expand the nested model to allow for superinfection. The introduction of the basic notions of adaptive dynamics contributes to simplifying our task of demonstrating the evolutionary branching leading to diverging dimorphism. The precise conclusions about the outcome of evolution will depend on the host demography as well as on the class of superinfection and the shape of transmission functions.
Extension of compact bilinear operators

Koutsompinas, Ioannis Jr (2021)

In this thesis we study extension results related to compact bilinear operators in the setting of interpolation theory and more specifically the complex interpolation method, as introduced by Calderón. We say that: 1. the bilinear operator T is compact if it maps bounded sets to sets of compact closure. 2.\bar{ A} = (A_0,A_1) is a Banach couple if A_0,A_1 are Banach spaces that are continuously embedded in the same Hausdorff topological vector space. Moreover, if (Ω,\mathcal{A}, μ) is a σ-finite measure space, we say that: 3. E is a Banach function space if E is a Banach space of scalar-valued functions defined on Ω that are finite μ-a.e. and so that the norm of E is related to the measure μ in an appropriate way. 4. the Banach function space E has absolutely continuous norm if for any function f ∈ E and for any sequence (Γ_n)_{n=1}^{+∞}⊂ \mathcal{A} satisfying χ_{Γn} → 0 μ-a.e. we have that ∥f · χ_{Γ_n}∥_E → 0. Assume that \bar{A} and \bar{B} are Banach couples, \bar{E} is a couple of Banach function spaces on Ω, θ ∈ (0, 1) and E_0 has absolutely continuous norm. If the bilinear operator T : (A_0 ∩ A_1) × (B_0 ∩ B_1) → E_0 ∩ E_1 satisfies a certain boundedness assumption and T : \tilde{A_0} × \tilde{B_0} → E_0 compactly, we show that T may be uniquely extended to a compact bilinear operator T : [A_0,A_1]_θ × [B_0,B_1]_θ → [E_0,E_1]_θ where \tilde{A_j} denotes the closure of A_0 ∩ A_1 in A_j and [A_0,A_1]_θ denotes the complex interpolation space generated by \bar{A}. The proof of this result comes after we study the case where the couple of Banach function spaces is replaced by a single Banach space.
Fukushiman hajotelmasta

Litmanen, Jenna (2023)

Tiivistelmä – Referat – Abstract Tässä työssä on tarkoituksena esittää Fukushiman hajotelma, jota voidaan käyttää yleistyksenä Itôn lemmalle. Ensimmäisessä luvussa käydään läpi perusteita stokastiselle analyysille. Työ etenee stokastisen analyysin perusteista Markovin prosesseihin ja tähän liittyviin käsitteisiin. Käydään läpi additiivisen funktionaalin käsite ja miten se liittyy käsiteltäviin prosesseihin. Martingaalien kohdalla käydään läpi peruskäsitteet. Tämän jälkeen siirrytään käsittelemään Itôn lemmaan ja tämän todistukseen. Itôn lemma on tärkeä työkalu taloustieteessä, etenkin kun työskennellään varallisuushintojen ja osakemarkkinoiden parissa. Itôn lemma luo pohja sille, kuinka varallisuushinnat voidaan määritellä Brownin liikkeen avulla. Samassa luvussa käsitellään myös muita hyödyllisiä stokastisen analyysin työkaluja. Yksi tällainen työkalu on Doobin-Meyer’n hajotelma martingaaleille ja ennustettavissa oleville prosesseille. Hajotelma on tärkeä työkalu, kun siirrytään korkeammalle tasolle stokastisten yhtälöiden kanssa. Ensimmäisen luvun lopussa käsitellään Sobolevin avaruutta, Dirichlet’n avaruutta ja Dirichlet’n muotoja. Näiden tarkoituksena on valmistaa lukijaa pohjatiedoiltaan seuraavaan lukuun, jossa käsitellään yhtä työn päälauseista Toisessa luvussa käsitellään additiivisen funktionaalin ja martingaaliadditiivisen funktionaalin energiaa ja Radon mittaa. Näiden käsittelyn jälkeen, siirrytään Itôn lemman yleistyksen pariin. Lopulta käsitellään yleistystä Itôn lemmalle. Yleistyksen pohjalla on mahdollisuus ottaa lauseesta “heikompi” versio, jolloin kaikkein vahvimpien ehtojen ja oletusten ei välttämättä tarvitse olla voimassa. Tämä on tärkeää, sillä Itôn lemman ehtona on jatkuvasti kahdesti differentioituvuus, joka ei läheskään aina toteudu stokastisissa prosesseissa. Näin ollen voidaan saavuttaa Itôn lemman edut kevyemmillä ehdoilla. Lopulta käsitellään Fukushiman hajotelmaa, joka on käytännöllinen prosesseille, jotka ovat semimartingaaleja. Fukushiman hajotelman avulla voidaan käsitellä tapauksia, joissa aiemmin käsiteltyjen lauseiden oletukset eivät täyty. Fukushiman hajotelma saadaan rakennettua aiemmin esitellyn lauseen avulla.

Now showing items 1-20 of 73

Browsing by master's degree program "Master 's Programme in Mathematics and Statistics"

Yhteystiedot

HELSINGIN YLIOPISTO