image

Contents

Preface

1 Introduction

1.1 Galaxy Formation

1.2 Decoding the Fossil Records: Photometric and Spectroscopic Diagnostics

1.3 Decoding the Fossil Record: the Tools

2 Low-Mass Star Physics

2.1 Basic Equations

2.2 The Thermodynamical Properties of Low-Mass Stars

2.3 Nuclear Energy Production and Nucleosynthesis

2.4 Radiative and Conductive Opacity

2.5 The Atmospheric Structure

2.6 Mass Loss

2.7 Diffusion Processes

2.8 Rotation

2.9 Internal Gravity Waves

2.10 Magnetic Fields

2.11 Thermohaline Mixing

3 From the Main Sequence to the Tip of the Red Giant Branch

3.1 Overview

3.2 Very Low-Mass Stars

3.3 The Main Sequence

3.4 The Standard Solar Model

3.5 Blue Stragglers

3.6 Subgiant Branch Evolution

3.7 Red Giant Branch Evolution

3.8 The Helium Flash

3.9 Theoretical Isochrones for the Hydrogen Burning Stages

4 Horizontal Branch and Red Clump

4.1 Overview

4.2 Mixing Processes

4.3 The Horizontal Branch Evolution

4.4 Extremely Hot Horizontal Branch and “Blue Hook” Stars

4.5 Low-Mass Pulsating Stars in the Core He-burning Stage

4.6 Red Clump Stars

4.7 The Effects of Non-canonical Processes on HB Evolution

5 Asymptotic Giant Branch

5.1 Overview

5.2 Early Asymptotic Giant Branch Evolution

5.3 The Thermally Pulsing Stage

5.4 Calibration of Evolutionary AGB Models

5.5 Synthetic AGB Stellar Models

5.6 Long Period Variables

5.7 Nucleocosmochronology

5.8 The End of the AGB and Post-AGB Evolution

6 White Dwarf Cooling Sequences

6.1 General Properties and Simple Models

6.2 Detailed Models

6.3 Colour–Magnitude Diagrams of WD Populations

6.4 Spectroscopic Classification of WDs

6.5 Low Mass CO- and He-core WDs

7 Resolved Old Stellar Populations in the Galaxy

7.1 Introduction

7.2 Old Star Cluster Distances

7.3 Old Star Cluster Ages

7.4 Ages of Old Field Stars from Photometry/Spectroscopy

7.5 Empirical Rotation/Activity–Age Relationships

7.6 Estimates of the Primordial Helium Abundance

7.7 Estimates of the Primordial Lithium Abundance

7.8 Multiple Stellar Populations in Galactic Globular Clusters

8 Resolved Composite Systems

8.1 Synthetic CMD-fitting

8.2 Multiple Isochrone-fitting

9 Unresolved Old Systems

9.1 Building Blocks of a SPSM

9.2 Low-Resolution Diagnostics

9.3 High-Resolution Diagnostics

9.4 Unresolved Composite Systems

9.5 Validation of Stellar Population Synthesis Techniques

References

Index

Related Titles

Hoard, D. W. (ed.)

White Dwarf Atmospheres and Circumstellar Environments

244 pages with 84 figures, Hardcover

2011

ISBN: 978-3-527-41031-6

 

Barnes, R. (ed.)

Formation and Evolution of Exoplanets

320 pages with 136 figures and 5 tables, Hardcover

2010

ISBN: 978-3-527-40896-2

 

Wang, Y.

Dark Energy

256 pages with 48 figures and 7 tables, Hardcover

2010

ISBN: 978-3-527-40941-9

 

Shore, S. N.

Astrophysical Hydrodynamics

An Introduction

472 pages with 70 figures, Softcover

2007

ISBN: 978-3-527-40669-2

 

Stahler, S. W., Palla, F.

The Formation of Stars

865 pages with 511 figures and 21 tables, Softcover

2004

ISBN: 978-3-527-40559-6

 

Spitzer, L.

Physical Processes in the Interstellar Medium

335 pages with 30 figures, Softcover

1998

ISBN: 978-0-471-29335-4

 

Shapiro, S. L., Teukolsky, S. A.

Black Holes, White Dwarfs and Neutron Stars

The Physics of Compact Objects

645 pages, Hardcover

1983

ISBN: 978-0-471-87317-4

image

The Authors

Prof. Santi Cassisi

INAF

Collurania Astronomical Observatory

Via M. Maggini

64100 Teramo

Italy

 

Prof. Maurizio Salaris

Liverpool John Moores University

12 Quays House

Birkenhead, CH41 1LD

United Kingdom

To my daughter

S. Cassisi

To my mum and the memory of my dad

M. Salaris

Preface

Without a doubt, one of the main problems in modern astrophysics is understanding how galaxies form and evolve. Within the framework of the currently accepted standard cosmological model, it is possible to perform simulations of the formation and evolution of cosmic structures; the results of these simulations are, however, affected by a number of uncertainties, stemming largely from the extreme difficulty of understanding the gas dynamics/processes that determine how baryonic matter settles into galaxies, cools and collapses to form stars, and how stellar winds and supernova explosions stir, mix and chemically enrich the remaining gas, and control the rate at which new stars form. A detailed calculation of these processes is far beyond the capabilities of modern computers.

Fossil evidence available in nearby galaxies – where “fossil” means that the observables are the result of the past star formation history – in the form of their old stellar populations, is an important part of the research on these open issues. The information encoded in the spectra of these stars contain a detailed record of their origin, hence of the mechanisms of galaxy formation and evolution. Moreover, galaxies in the nearby universe can be studied in much greater detail compared to high-redshift counterparts, and provide a picture of the evolution of cosmic structures complementary to that obtained from high-redshift observations.

This book presents and discusses in detail a wide range of techniques developed in the last decades to extract information about formation and evolution of old stellar populations from their photometric and spectroscopic observations. Our definition of old stellar populations comes from the theory of stellar evolution; a stellar system is denoted as “old” if it harbours in the post-Main Sequence stages – excluding remnants like white dwarfs, neutron stars and black holes – stars with mass below ≈ 2.0Mimage (low-mass stars). These old populations are characterized by the signature of a well defined red giant branch in the colour–magnitude-diagram or integrated spectrum. In terms of ages, this implies values above ≈ 1 Gyr, meaning that these stars are the fossil record of the development of galaxies over more than 90% of cosmic evolution. We will use a somewhat arbitrary definition of nearby Universe, in the sense that we discuss methods for both resolved and unresolved populations, and their use to study stellar systems at low redshifts, corresponding to look-back times that are a negligible fraction of the age of the Universe.

This work is the “natural” companion and follow-up of our previous volume, “Evolution of stars and stellar populations”, that was aimed at introducing undergraduate and postgraduate students to modern stellar evolution studies, and the associated techniques to study stellar populations of all ages, including the basics of “stellar population synthesis techniques”. Here, we go several steps further in terms of the level of detail. At the same time, we focus exclusively on old stellar populations. The reader will acquire an in-depth and up-to-date specialistic knowledge of the evolutionary properties of stars hosted by old stellar populations, and how to use their observations to study the evolution of the parent systems. Our book is about “tools and techniques” to study old stellar populations, and the related uncertainties, not about the results in terms of scenarios of galaxy formation; this would require an entire volume on its own. However, we will show some practical applications of the methods, mainly with a view to highlight their reliability.

Chapter 1 briefly introduces the standard Λ Cold Dark Matter cosmology, the problems related to modelling the formation of cosmic structures, and the basic tools for stellar population analyses. Chapter 2 discusses a number of topics related to stellar input physics and model calculations, with particular emphasis on those physical mechanisms that are still uncertain and mostly affect the results presented in the following chapters. The next four chapters present a detailed qualitative and quantitative picture of the life-cycle of low-mass stars, from their formation to the final stage of cooling white dwarfs. Chapters 7–9 present and analyze methods to study simple and composite, resolved and unresolved old stellar populations, based on the stellar evolution previously discussed, providing the reader with the most recent developments in the field. We discuss the recently discovered phenomenon of “multi-populations” in Galactic globular clusters in detail, and its impact on Galactic and extragalactic age-dating techniques. We have also included a large number of references (not necessarily all-encompassing and unavoidably biased by our own familiarity with specific papers) to guide the reader through the recent literature, major milestones and review papers.

We are greatly indebted to Lucio Primo Pacinelli (INAF – Astronomical Observatory of Collurania) and David Hyder (Liverpool John Moores University) for their superb work on many of the figures included in this book, and Katrina Exter for her invaluable help with editing some of the chapters.

Special thanks go to the following colleagues for several figures included in the book as well as for their useful suggestions: F. Allard, H.M. Antia, A. Aparicio, S. Basu, P. Baumann, B. Behr, A. Bellini, D. Bersier, G. Bono, D. Brown, R. Buonanno, E. Carretta, C. Charbonnel, S. Cristallo, E. Dalessandro, P. Eggenberger, J.W. Ferguson, F. Ferraro, G. Fiorentino, C. Gallart, F. Grundahl, S. Hidalgo, A. Irwin, N. Lagarde, A. Marín-Franch, A.F. Marino, G. Michaud, M.M. Miller Bertolami, A.P. Milone, M. Monelli, A. Mucciarelli, S. Percival, L. Piersanti, A.M. Piersimoni, A. Pietrinferni, G. Piotto, J. Richer, L. Sbordone, E. Small, O. Straniero, S. Talon, D. Vandenberg, A. Weiss, P. Wood, and M. Zoccali.

Finally, we wish to warmly thank all of our colleagues and collaborators with whom, over the course of the years, we have investigated and discussed several of the topics addressed in this book.

Teramo

Santi Cassisi

Liverpool

Maurizio Salaris

June 2012

1

Introduction

The so-called Λ Cold Dark Matter (ΛCDM) model is currently the most widely accepted theory for the structure and evolution of the Universe. A combination of experimental data involving the power spectrum of the Cosmic Microwave Background (CMB) radiation temperature fluctuations, the abundance and weak-lensing mass measurements of galaxy clusters, the Type Ia supernovae magnitude-redshift relation and large scale structure observations has placed strong constraints on the free parameters of this cosmological model [1].

In the framework of Λ CDM cosmology, the large scale structure of the Universe is assumed to be homogeneous and isotropic – denoted as “cosmological principle” – on scales of the order of 100 Mpc. Therefore, the local inhomogeneities can be treated as perturbations to the general homogeneity of the Universe, and it is represented by the Friedmann–Robertson–Walker metrics. The evolution with time of this model universe is described by

(1.1) image

(1.2) image

as obtained from the field equations of general relativity. Here, k denotes the spatial curvature, dt the cosmic time separation, ρ the density of matter plus radiation. R(t) is the so-called cosmic scale factor that describes the expansion (or contraction) of the Universe. It has the dimensions of a distance and is dependent on the cosmic time t. The constant k is defined in a way that k = +1 for a positive spatial curvature, k = 0 for a flat space and k = −1 for a negative curvature. The cosmic time is defined by standard clocks comoving with the cosmic fluid. If one sets t to the same value for all clocks when a local property of the cosmic fluid, for example, the average local density of matter, has attained a certain agreed value, by virtue of the cosmological principle, the same value of that property (possibly different from the one at the time of synchronization) has to be measured whenever clocks show the same time.

Equation (1.1) includes the cosmological constant Λ. If Λ were to vary with t, it represents a so-called “dark energy”. Current observations are consistent with Λ being a constant.1) The evolution of R(t) is controlled by ρ, the geometry k and Λ.

By defining the function H(t) as

(1.3) image

one can rewrite Eq. (1.1) as

(1.4) image

The value of H(t) determined today is the Hubble constant H0. It is customary to introduce the critical density ρc ≡ 3 H(t)2/(8πG) and define the density parameter Ωρ = ρ/ρc, an equivalent for the cosmological constant ΩΛ = Λ/(3H(t)2), and the sum Ω = Ωρ + ΩΛ. With these definitions, Eq. (1.4) becomes

(1.5) image

Ω = 1 gives a flat space, Ω > 1 a positive curvature and Ω < 1 a negative curvature.

At present, ρ appears to be dominated by matter, the contribution from photons being negligible (Ωr = (4.800 ± 0.014) × 10−5). Ωρ is split into contributions from baryonic (ΩB = 0.0456 ± 0.0016) and dark matter (ΩDM = 0.227 ± 0.014). The presence of “cold” (with a negligible velocity dispersion) dark matter – called “dark” because it does not interact electromagnetically with the other components of the cosmic fluid – is well-established by observations of galaxy rotation curves, the X-ray emission of the hot ionized gas in clusters of galaxies and gravitational lensing surveys. As for its origin, various candidates predicted by extensions of the Standard Model of particle physics have been proposed, but to date, the question about the nature of dark matter is awaiting a definitive answer.

The dominant contribution to the present energy budget of the Universe is, however, given by Λ, that is, image. This implies that ΩΛ + ΩB + ΩDM = 1 and the space is flat.

As we go backwards in cosmic time, R(t) decreases and the radiation density increases faster than the density of matter. Therefore, there must have been a moment in the cosmic history when the Universe was radiation dominated; the cosmic time tE of matter-radiation equality is of the order of 105 years. Going back to even earlier cosmic times, it is clear that the Universe must have started from a singular state with R = 0 and ρ = ∞ at t = 0. Present estimates of the matter and Λ density coupled to the equations for the evolution of R(t) provide a value of the Hubble constant image km Mpc−1s−1 and an age of the Universe tU = 13.75 ± 0.11 Gyr.

Now, reversing the arrow of cosmic time, as the Universe expands and cools from the initial singularity during the initial radiation dominated era, a small amount of some light elements (mainly D, 3He, 4He, 7Li) is produced when the temperature evolves from T ~ 109 K down to 3 × 107 K. The amount of light nuclei produced depends on the baryon density and the expansion (i.e. cooling) rate. Calculations of cosmological nucleosynthesis – assuming three neutrino species with mass much smaller than 1MeV – provide, for the present estimate of the baryon density, a primordial He mass fraction YP = 0.2487 ± 0.0002, and number abundance ratios (D/H)p = (2.52 ± 0.17) × 10−5, (3He/H)P = (1.03 ± 0.03) × 10−5, image for D, 3He and 7Li, respectively [3, 4].

When the temperature drops below the ionization energy of hydrogen (13.6 eV), the ionization fraction, however, stays close to one due to the large number of photons over baryons (photons dominate by number, although matter dominates energetically and therefore gravitationally). The number of photons in the high energy tail of the black body spectrum is high enough to keep the matter fully ionized. Eventually, the temperature and therefore the number density of sufficiently energetic photons drops so low that recombination prevails. It is at this time, ~ 105.5 years after the singularity (i.e. when T ~ 4000 K), that the first atoms form. The resulting dearth of free electrons has the consequence of reducing the efficiency of electron scattering, so that matter and radiation decouple. From this moment on, the temperatures of radiation and matter become different and start to evolve separately; radiation does not interact any longer with matter and can travel undisturbed through space. The radiation temperature Tr is reduced according to TrR(t)−1, and the blackbody spectrum it had at decoupling is preserved. This blackbody radiation, largely homogeneous and isotropic (because of the cosmological principle) with a temperature Tr of ~ 2.73 K is the theoretical counterpart of the CMB.

If the Universe was perfectly isotropic and homogeneous, no structures would have formed with time. However, in the case of inhomogeneities, regions denser than the background tend to contract and get denser still, inducing a growth of the initial perturbation. In 1970, Peebles, Yu, Sunyaev and Zel’dovich predicted that these inhomogeneities had to be imprinted in the CMB as the tiny temperature fluctuations that have been recently detected (they are of the order of ΔT/ T ≈ 10−5). Fluctuations of the local density of baryonic matter would have behaved as sound waves (with their fundamental mode plus overtones) in the cosmic fluid before recombination, with the photons (to which baryons were tightly coupled before recombination) providing the restoring force. At recombination, the photons started to travel for the first time unimpeded through space; photons released from denser, hotter regions were more energetic than photons released from more rarefied regions. This temperature differences were thus frozen into the CMB at recombination and are detected today. Most importantly, the amplitude and location of the peaks in the power spectrum of these temperature fluctuations are closely related to a number of cosmological parameters (for more details see, for example, the discussion in [5]); in particular, the location of the first peak is mainly related to the geometry of the tridimensional space, whereas the ratio of the heights of the first to second peak is strongly dependent on ΩB. Also, the values of the Hubble constant and of the cosmological constant affect both location and amplitudes of the peaks, albeit with different sensitivities. Observations show that the fluctuations are consistent with a Gaussian random field and the power spectrum is close to scale invariant, that is, image with ns = 0.963 ± 0.012 (ns = 1 for a scale-invariant spectrum).

The widely accepted idea about their origin relates to the so-called inflationary paradigm that can, in principle, explain why Ω = 1 and solve the so-called horizon problem, that is, why the CMB across the sky is to a very good approximation isotropic, even though the size of the region causally connected at decoupling corresponds to about only one degree in the sky today.

The central idea envisages a period in the early Universe where a term Λinf originated by a hypothetical quantum scalar field analogous to the cosmological constant dominates Eq. (1.4). This can be rewritten as

(1.6) image

and its solution, assuming a constant Λinf, is

(1.7) image

if t is much larger than the cosmic time t = tinf of the beginning of the Λinf dominated epoch. Provided this exponential expansion (inflation) is long enough, Ω its driven towards one, irrespective of its initial value. Moreover, during inflation, a very small patch of the Universe can grow to enormous dimensions, so that the isotropy of the CMB temperature we see today, arose from a very small causally connected region that underwent an inflationary growth. An expansion by a factor of ≈ 1030 solves both the flatness and horizon problem without invoking ad hoc initial conditions. The quantum field that originated Λinf is expected to experience quantum fluctuations that were stretched by the inflation to the scales we see imprinted in the CMB. The general belief is that inflation occurred when the strong force separated from the electroweak one, at about t = 10−35 s, and lasted until about t = 10−32 s.

1.1 Galaxy Formation

Starting from the ΛCDM cosmological model, one of the main problems in modern astrophysics research is to understand how a galaxy forms and evolves [6, 7]. Here, we will just summarize the main concepts. The basic idea underpinning the currently most accepted paradigm of galaxy formation is that cosmic structures grow through the mechanism of gravitational instability starting from the pattern of density fluctuations imprinted in the CMB. The dark matter component, having no pressure, begins to collapse well before the baryonic matter, and the primordial density fluctuations will grow. The early evolution of this dissipationless (no energy can be lost through electromagnetic interactions) growth is described by the linear perturbation theory. Once the perturbations become non-linear, their evolution is modelled using N-body simulations. The final result of the non-linear evolution of a dark matter density fluctuation is the formation of a dark matter halo, an approximately stable state supported by the random motions of the dark matter particles. In standard Λ CDM cosmology, the first haloes form from the smallest scale fluctuations, and are followed by successive merging episodes that produce increasingly more massive structures. The baryons are dragged along by the dark matter that dominates gravity, and the haloes are therefore expected to accrete baryons, so that the individual galaxies we see can be interpreted as the product of the evolution of baryonic matter nested inside a much larger halo of CDM. The efficiency of the accretion process will depend on the depth of halo potential well, and on the pressure of the baryons. Therefore, a complete description of the formation and evolution of galaxies requires a detailed description of the evolution of the dissipative baryons, that is, their accretion, heating, cooling, the associated star formation processes, the chemical evolution, the effect of halo merging on the baryonic component.

Overall, at any given cosmic time, matter is distributed over structures spanning many decades in mass, and growth is driven by merging between haloes of similar mass, by accretion of much less massive haloes and diffuse material, and by destruction by infall onto larger haloes (hence, the name of “hierarchical merging” scenario). A crucial point is that galaxy morphology may be a transient phenomenon and that the different types of galaxies we observe nowadays (the Hubble sequence) reflect the variety of accretion histories. As an example, this scenario envisages that baryons falling smoothly into the potential wells of dark matter haloes produce disks, whereas spheroids (bulges of disk galaxies and isolated elliptical galaxies) are the products of major (i.e. between haloes of similar mass) merger events whereby disks are mixed violently on short timescales, and a burst of star formation depletes substantially the gas content of the merging disks. Smooth accretion of intergalactic gas on spheroids appears then to be able to produce a typical present-day disk galaxy, for example, a rotationally supported disk of gas and young stars and a centrally concentrated bulge system of old stars.

One of the major difficulties in modelling from first principles how galaxies form and evolve is related to the physics that drives the evolution of the baryonic component, that is, gas cooling in dark matter haloes, star formation, chemical evolution and feedback mechanisms that either remove cold gas from a disk or suppress gas cooling. These processes, sometimes labelled “gastrophysics”, are still poorly understood in a galactic context and prescriptions that contain free parameters are employed. These parameters are fixed by demanding that models reproduce sets of available observations, typically at low redshifts. A powerful way to provide strong independent constraints on galaxy formation models in general, and on “gastrophysics” in particular, is to determine star formation histories of stellar populations in galaxies using methods from stellar evolution theory.

A wide range of techniques developed in the last decades make use of stellar evolution models to estimate distances, ages, star formation and chemical evolution histories of galaxies. It is these techniques, the related uncertainties and their future developments that we are going to discuss in detail in the following chapters, beginning with their theoretical foundations grounded in stellar evolution theory. We will, at the same time, discuss the use of these methods to determine the star formation and chemical evolution histories of old, local stellar populations.

Although observations at high redshift are certainly a more direct way of looking at the formation of galaxies, these objects are very faint and one can derive a more limited amount of information compared to that obtained from nearby galaxies. Stellar populations in the nearby Universe can be studied in much greater detail and provide a view of galaxy formation and evolution that is complementary to that obtained from high redshift data. Also, observations and analyses of the oldest local stellar populations provide completely independent powerful tests and constraints to the overall cosmological model.

We will use a somewhat arbitrary definition of the nearby Universe, one that often only includes those stellar systems resolvable into individual stars. Here, we will also discuss methods for unresolved stellar populations, and their use to study stellar populations at low redshifts corresponding to look-back times that are a negligible fraction of the age of the Universe.

Our definition of old stellar population comes from stellar evolution theory. We denote as “old” all stellar systems harbouring objects with mass below ≈ 2.0Mimage, that is, those stars that undergo electron degeneracy in their He-core at the end of the main sequence (MS) phase, plus remnants of more massive objects. Observationally, the colour–magnitude-diagram (CMD) of these populations is characterized by the presence of a well-defined red giant branch (RGB) sequence. In terms of ages, our definition of old populations implies values above ≈ 1 Gyr, that is, these stars are the fossil record (fossil in the sense that their observables are the result of the past star formation history) of the development of cosmic structures over more than 90% of cosmic evolution.

1.2 Decoding the Fossil Records: Photometric and Spectroscopic Diagnostics

The amount of evolutionary information we can gather from a stellar population is encoded in the radiation emitted by its stars. A population that is resolved observationally into its individual constituents is denoted as “resolved”, while with an “unresolved” stellar population, we denote a system from which we can only observe the integrated light, that is, the sum of the contribution of all its components.

In the case of resolved populations, one can obtain high-resolution spectra – resolution of the order of ≈ 0.1 Å, most often in the visible wavelength range – of individual stars, that allow the determination of photospheric abundances of several chemical elements using model atmosphere calculations. These abundances provide constraints on the initial chemical composition of the star. Photometric observations through broadband filters (i.e. Johnson–Cousins, Sloan, WFC3, 2MASS) with passbands of order ≈ 103 Å, typically spanning the wavelength range from near-ultraviolet (near-UV) to near-infrared (near-IR), are used to produce CMDs. These diagrams are the main tool to constrain ages, star formation and chemical evolution histories – this latter in conjunction with estimates from spectroscopy, if/when available – of resolved stellar populations. Intermediate band photometric filters (i.e. Strömgren, DDO, Walraven) with passbands of order ≈ 102 Å have been devised to derive indices that are particularly sensitive to some specific stellar property, like surface gravity or metallicity (e.g. the indices c1 and m1 in the Strömgren system). Narrow band systems – passbands of order ≈ 10 Å – also exist (i.e. Oke, Wing). For example, individual bands in the Wing systems are sensitive to CaH, CN, TiO or VO.2)

As for unresolved stellar populations, integrated photometry typically in broadband photometric systems provides colour–colour diagrams or – when measurements in several bands are available – a very low-resolution spectral energy distribution (SED) that can be used to constrain ages and metal content of the parent populations. Integrated spectra with resolution of ≈ 1–10 Å can be used to infer ages and, in principle, detailed individual abundances of several chemical species.

Figure 1.1 displays some examples of photometric and spectroscopic data for resolved and unresolved populations.

Figure 1.1 (a) A CMD of the Galactic globular cluster 47Tuc [10]. (b) Integrated UBV RI JHK magnitudes of the elliptical galaxy NGC584 [11]. (c) Portion of the spectrum of the star HD9562 [12]. (d) Integrated spectrum of the dwarf elliptical galaxy M32 [13]. The fluxes are normalized to the observed flux at 4000 Å.

image

1.3 Decoding the Fossil Record: the Tools

Observed CMDs of resolved populations, SED, colour–colour diagrams, integrated spectra of unresolved populations can be interpreted in terms of evolutionary properties of the parent stars by applying results from stellar evolution theory.

1.3.1 Theoretical Stellar Models, Tracks, Isochrones

The solution of the equations of stellar structure provides a stellar evolution model, that is, the run of physical and chemical quantities from the centre to the photosphere of a star of given initial mass and initial chemical composition, and their evolution with time. It is common practice in stellar astrophysics to specify the initial chemical composition of stellar models by means of the symbols X, Y and Z. They denote the mass fractions of hydrogen, helium and all other elements heavier than helium (called metals, hence Z is also called the metallicity of the star) respectively; these three parameters are related through the normalization X + Y + Z = 1. For the metals, the distribution of the individual fractional abundances has to be specified.

This is a convenient choice from the theoretical point of view, though it is not directly related to what is determined from stellar spectroscopy. The helium abundance, for example, cannot be determined for all stars since low-mass objects are generally too cool to show helium spectral lines, and the metal abundances are usually determined differentially with respect to the Sun. The traditional metal abundance indicator is the quantity [Fe/H] ≡ log [N(Fe)/N(H)]* – log [N(Fe)/N(H)]image, that is, the difference of the logarithm of the Fe/H number abundance ratios observed in the atmosphere of the target star and in the solar one. If one assumes that the solar heavy element distribution is universal, the conversion from Z to [Fe/H] is given by

(1.8) image

If one relaxes the assumption of a universal scaled-solar heavy element distribution, the correspondence between [Fe/H] and X, Y, Z obviously changes because the ratio between the iron abundance and Z is different than in the Sun. In this case, one can still use Eq. (1.8), provided that the left-hand side denotes the ratio of the “total” abundance of metals to hydrogen

(1.9) image

A basic working tool in stellar evolution studies is the Hertzsprung–Russell diagram (HRD), a plot of a star bolometric luminosity versus its effective temperature – both outcomes of stellar evolution calculations – that will be widely used in the next chapters. The evolution of a stellar model in the HRD is denoted as the “stellar track”.

Figure 1.2 Theoretical HRD for the evolution of the Sun. All major evolutionary phases are labelled (see text for details).

image

Figure 1.2 displays the complete theoretical HRD, from birth to the final evolutionary stage, for the Sun. All main evolutionary stages of low-mass stars discussed in the next chapters are covered in this diagram. The starting point of the evolution shown in Figure 1.2 is the pre-main sequence (PMS) phase, where the star is convective and evolves vertically in the HRD along its so-called Hayashi track driven by gravitational contraction according to the virial theorem, until a radiative core forms and eventually H-burning ignites in the central regions. At this stage, the evolution slows down, while H is converted into He in the core, during what is denoted as the main sequence (MS) phase. It is the longest evolutionary phase and as a consequence, the number of H-burning stars which can be observed in a given population is much larger than the number of stars in any other phase. When the central H is exhausted, the H-burning shifts to a narrow shell around the pure He-core left over by the central H-burning, and the subgiant branch phase (SGB) starts. During the SGB evolution, the He-core undergoes electron degeneracy, the external layers expand and the photosphere gets cooler, until the model – by now with a very deep convective envelope – reaches its red giant branch (RGB) location, an approximately vertical sequence in the HRD. During SGB and RGB evolution, the He-core grows steadily in mass due to fresh He deposited by the outward moving (in terms of mass layers) H-burning shell, and correspondingly, the surface luminosity increases. Along the RGB, mass loss processes are also efficient, and reduce the mass of the convective envelope.

When the electron degenerate core reaches a mass of about 0.47Mimage, He-burning ignites in the core (He flash), the electron degeneracy is lifted and the star begins a phase of quiescent central He-burning (labelled as He-burn in Figure 1.2) at a much lower luminosity compared to the end termination of the RGB. The precise value of Teff is determined by the amount of mass left in the envelope, Teff getting higher with decreasing envelope mass (more efficient RGB mass loss). In metal-poor systems like Galactic globular clusters (GCs), the central He-burning stage appears as an extended, more or less horizontal sequence in the CMD at visible wavelengths, and it’s called the Horizontal Branch (HB). During this central He-burning phase, the H-burning shell is still active, and when He is exhausted in the core, shell burning provides the energy necessary to maintain hydrostatic equilibrium.

The model now has a C and O core left over by the central He-burning that becomes electron degenerate, marking the beginning of the asymptotic giant branch (AGB) phase. During the AGB evolution, H and He shell burning are never active at the same time, instead they take turns at producing the nuclear energy necessary to maintain hydrostatic equilibrium through a series of so-called thermal pulses (TPs). The AGB sequence practically overlaps with the RGB in the HRD, and the evolution is towards increasing luminosity because of the mass increase of the CO-core due to the He-burning shell. Mass loss is again very effective in reducing the envelope mass. When the envelope mass (reduced by both the growth of the CO-core and mass loss from the photosphere) is reduced to ≈ 0.01Mimage, the model starts its post AGB (PAGB) phase, moving towards higher Teff at constant luminosity, fixed by the value of the CO-core mass, equal to about 0.54Mimage, with some residual shell burning.

The PAGB phase terminates when the model reaches its white dwarf (WD) cooling sequence. From this point on, the evolution is towards lower surface luminosities (and Teff), the energy radiated being the free energy of the non-degenerate ions, while hydrostatic equilibrium is maintained by the pressure of the degenerate electrons.

Stellar tracks of different masses and chemical compositions can be combined to predict the HRD of stars harboured by stellar populations with a generic star formation and chemical evolution history. The most elementary stellar population, usually denoted as simple stellar population (SSP), is made of objects born at the same time in a burst of star formation activity of negligible duration, with the same initial chemical composition. Any population with an arbitrarily complex evolutionary history can be reduced to a linear combination of several SSPs.

The theoretical HRD of a SSP is called isochrone. Consider a set of evolutionary tracks of stars with the same initial chemical composition and various initial masses; different points along an individual track correspond to different values of the time ttr and the same initial mass. An isochrone of age t is the line in the HRD that connects the points belonging to the various tracks (one point per track) where ttr = t. This means that when we move along an isochrone, time is constant. However, the value of the initial mass of the star at each point is changing, that is, increasing towards more advanced evolutionary stages. A generic point along an isochrone of age t is therefore determined by three quantities: bolometric luminosity, effective temperature and value of the evolving mass. From these three parameters, one can easily derive additional quantities like the radius and surface gravity. It is also straightforward to associate to each point along an isochrone the expected surface chemical abundances – taken from the underlying grid of stellar evolution models – to be compared with spectroscopic measurements.

Figure 1.3 Isochrones for 5 and 13 Gyr and solar initial chemical composition (thick solid lines) overimposed on a set of stellar tracks for the same initial chemical composition (thin-dashed lines). We display in order of increasing luminosities along the MS, tracks for 0.6, 0.8, 0.9, 1.0, 1.1, 1.2 and 1.4Mimage.

image

If mass loss processes are included in the individual stellar tracks – as we have briefly seen before, mass loss is very efficient during the RGB and AGB evolution of low-mass stars – the situation is only slightly more complicated because along each track the total mass is changing with time. The procedure to compute the isochrones is the same, that is, one connects the points of equal age along tracks with various initial masses. However, the value of the mass evolving at a given point along the isochrone is now smaller than the initial mass of the parent track.

Figure 1.3 shows, as an example, two isochrones for ages equal to 5 and 13 Gyr, respectively, and the solar initial chemical composition, that cover the evolutionary phases from the MS to the He flash. Notice the large mass range spanned by the MS phase, whereas the SGB and RGB stages (this is true for all post-MS phases but the final WD stage) are populated by objects with almost the same mass, equal to the value at the termination of the MS (the so-called turn off point – TO). This is a consequence of their much shorter evolutionary timescales, compared to MS lifetimes.

Figure 1.4 Isochrones for 2, 4, 6, 8, 10 and 12 Gyr (solid lines) overimposed on a set of WD cooling tracks with masses between 0.54 and 1.0Mimage (dashed lines).

image

Representative isochrones for the final WD stages are shown in Figure 1.4. Along a WD isochrone, it is the sum of the WD cooling age plus the progenitor age at the end of the TP phase that stays constant. Obviously, a fundamental ingredient to calculate WD isochrones is the relationship between the mass of the progenitor on the MS (initial mass) and its final WD mass (initial-final mass relation – IFMR – see Chapter 6 for details). Due to the fast cooling times at high Teff, the bright section of a WD isochrone of fixed age is populated by objects with approximately the same mass, corresponding approximately to the final WD mass of stars that have just left the TP phase. One can also notice that all isochrones in Figure 1.4 share the same location at bright luminosities because empirical and theoretical IFMRs predict essentially the same WD mass (~ 0.55Mimage) for all low-mass stars. At the bottom end of the sequence, one recovers the progeny of more massive intermediate-mass stars that have left the TP phase earlier. This explains the sudden increase of Teff (lower radii) since the more massive progenitors produce WDs of increasing mass, and hence lower radius.

Once a generic isochrone of a given age and initial chemical composition is computed, it can be transferred to an observational CMD, that is, the plot of a star magnitude in a given photometric band versus a colour index. To predict magnitudes in a generic photometric system, one needs to assign a spectrum to each point along the isochrone, making use of model atmosphere calculations.

A model atmosphere describes the physical and chemical structure of a stellar atmosphere and the transfer of radiation from the photosphere into interstellar space. It is defined (at least in the plane-parallel approximation) by three quantities: the value of Teff, the surface gravity g and the photospheric chemical composition.

Interpolation among a suitable grid of spectra obtained from model atmosphere calculations provides spectra to assign to each point along the isochrone.

From a given spectrum, it is straightforward to compute bolometric corrections BCA to a generic filter A from the following equation:3)

(1.10) image

Here, Mbol,image is the bolometric magnitude of the Sun, Fλ is the model spectrum (i.e. from model atmosphere calculations), Sλ is the response function of the photometric filter (a measure of the efficiency of photon detection within the filter wavelength range) that covers the wavelength range between λ1 and λ2, image denotes the reference spectrum that produces a known apparent magnitude image (for example, the spectrum of Vega, or a spectrum of constant flux density per unit frequency for the ABmag system. In both cases, the apparent magnitude image is usually set to zero). From BCA, one can then obtain the magnitude MA from

(1.11) image

where image. The value of Limage is known, and once the bolometric magnitude of the Sun is fixed (e.g. Mbol,image = 4.74 [14]), what is left in order to convert Teff and the bolometric luminosity L into magnitudes is the choice of image and image, the so-called “zero points” of the photometric system.

The very popular Johnson–Cousins and the Hubble Space Telescope HST/ WFPC2 VEGAmag systems, for example, make use of the star Vega to fix the zero points, assuming that the apparent V magnitude of Vega and all its colour indices are equal to zero. The Sloan system instead uses a zero point flux image = 3.63110−1 ergs−1 cm−2 Hz−1 that gives image at all frequencies (see [15] for details about zero points of several photometric systems). Figure 1.5 displays the response function of a few widely used filters;4) a theoretical low-resolution spectrum of Vega is overimposed, for the sake of comparison.

The transformation from theoretical luminosity and Teff to observed magnitudes and colour indices has been described assuming up to now that Fλ at the stellar surface is obtained from the appropriate theoretical model atmospheres. It is, however, known that current theoretical model atmospheres suffer from at least two main shortcomings:

Figure 1.5 Response functions of the uv by Strömgren photometric filters (a), Johnson–Cousins UBVRI and 2MASS JHKs (b). Overimposed is a theoretical spectrum of the star Vega (dashed line) [16].

image

These shortcomings cause an uncertainty of possibly a few hundredths of a magnitude on the BCA values. An alternative solution is to use empirical spectra of a sample of nearby stars with independently determined Teff, gravities and chemical composition. A problem with this approach is that stars for which empirical Teff values can be determined are local objects that cover a fairly narrow combined range of chemical compositions, masses and evolutionary phases (reflecting the local population of the Galactic disk) and would not allow a complete modelling of different stellar populations.

1.3.2 Luminosity Functions, Synthetic CMDs

The CMD of an isochrone only provides a morphological counterpart to the observed CMDs of resolved stellar populations, for it does not give any information about the number of objects expected along the different branches of the observational diagrams. As we will see in the next chapters, just using isochrones is enough to constrain ages and initial chemical compositions of resolved SSPs. However, additional information coming from star counts must be accounted for when trying to disentangle the star formation and chemical evolution history of more complex stellar populations.

Given that each point along an isochrone corresponds to a specific evolutionary stage of a model of initial mass M, assuming a stellar initial mass function (IMF) that provides the number of stars dN born with mass between M and M + dM, will also provide the number of objects populating a generic interval between two consecutive points along the isochrone. For an IMF of the standard form dN = CMx dM, the normalization constant C can be fixed by specifying either the total mass (Mt) or the total number (Nt) of stars born when the SSP formed. When x ≠ 2, the value of the constant C is given by

(1.12) image

where x is the exponent of the IMF, Mu and Ml are, respectively, the upper and lower mass limits of the stellar entire mass spectrum, for example, ~ 0.1 and ~ 100Mimage. If x = 2,

(1.13) image

This normalization guarantees that the total mass of stars formed stays constant, independent of the value of x, but the initial number of stars formed changes with changing value of the slope of the IMF. In the case Nt instead of Mt is given, the previous relationships have to be rewritten as

(1.14) image

if x ≠ 1 and

(1.15) image

if x = 1.

One way to compare observed star number counts with theory is to use luminosity functions (LFs). One can make use of two types of LFs. The differential LF, for example, the run of the observed star number counts image in the magnitude range between image and image as a function of image, or the cumulative LF, that is, the run of the number of stars N(MA > image) (or N(MA < image)) with magnitude MA larger than image (or lower than image), as a function of image itself.

Figure 1.6 Differential (a,c) and cumulative (b,d) LFs for the isochrones displayed in Figure 1.3, computed with a Salpeter IMF and a total population of 106 stars.

image

Figure 1.6 displays both the differential and cumulative LF for the two isochrones displayed in Figure 1.3. The normalization constant has been chosen in order to have 106 objects along each isochrone. The exponent is x = –2.35, the so-called Salpeter IMF [17].

Finally, a synthetic CMD combines both isochrone and the predicted star counts (for a given IMF). Synthetic CMDs can be easily calculated by iterating the following procedure. For a fixed age and initial chemical composition of a SSP (the generalization to complex star formation histories is straightforward), the starting point is to randomly draw a value of the mass M for the synthetic star according to a specified IMF. With this value of M, one interpolates along the appropriate isochrone to determine the magnitude and colour of this synthetic object. Inclusion of photometric errors and unresolved binaries is also straightforward. The magnitude and colour obtained from the isochrone interpolation are modified by adding a value of the photometric error, drawn from a Gaussian (for example) random distribution with σ specified by the data reduction process. A given number of unresolved (non-interacting) binaries or blends of single stars can be included in a similar fashion. To these selected stars (whose mass is denoted as Mpr), it is assigned a companion (with the same age and metallicity of Mpr) whose mass Mcomp is drawn according to

(1.16) image