ADVERTISEMENT

Measuring Distance In Space Using The Dark Energy Survey Portal

Distance measurement is an important challenge in astronomy. Between the parallax of nearby stars and the redshift of the most distant galaxies, there are a lot of steps. With the Dark Energy rush, several large astronomical surveys are trying to understand the accelerated expansion of the universe. The Dark Energy Survey (DES) has just completed its observations last January, cataloging hundreds of millions of galaxies.

Obtaining precise distance measurements of such a large number of galaxies is a hard task. To address this problem, photometric redshifts (photo-z’s) have been largely used by the astronomical community. Although less accurate than spectroscopic ones, they are cheaper and faster (regarding the number of galaxies measured per exposure time) and, hence, allow observations to go beyond signal-to-noise ratio limits of spectroscopic observations.

ADVERTISEMENT

The complete procedure to estimate photo-z’s involves many steps. First, it requires the handling of a multitude of spectroscopic datasets for calibration, matching between the sources on these spectroscopic datasets and the ones at the photometric survey, training of empirical algorithms (e.g.: neural networks and random forest), validation of these algorithms (e.g.: bias and variance), and, finally, measuring photo-z’s for large samples.

In this work, we address how the DES Science Portal infrastructure presents itself as a solution to connect all these steps in a stable environment, ensuring consistency and provenance control. The Portal is a web-based tool that combines a web application, a workflow system, a cluster of computers, and two databases. It is developed collaboratively using GIT by a large number of IT and science people geographically spread across Brazil. We also have contributions from DES members in several other countries among DES participant institutions.

The chain of tasks mentioned above is executed in the Portal as a modular workflow where intermediate products are created and consumed by the next phase, as shown in the schema (Figure 1). The data, represented by yellow cylinders, flow through pipelines (in green), which are self-consistent blocks that can be composed of one or more independent components (white boxes). For data-intensive tasks, like the computing of photo-z’s for large samples, these components run in parallel in a cluster of computers (components delimited by the dashed line).

Image republished with permission from Elsevier from https://doi.org/10.1016/j.ascom.2018.08.008

We use data from DES Y1 internal release for illustrative examples of running the chain of pipelines related to photo-zs in the Portal. These pipelines are part of a larger group that is used to create science-ready catalogs (Figure 2), and connect directly to Scientific Workflows (see Fausti et al. 2018 for further details). We explore different configurations of parallelization using one photo-z algorithm as an example. We study how the duration of the execution depends on the size of the data chunk analyzed by computer node, forecasting the optimal choices for running on future DES data releases.

ADVERTISEMENT
Image republished with permission from Elsevier from https://doi.org/10.1016/j.ascom.2018.08.008

The Portal is a useful platform for long-term projects involving a large number of collaborators and requiring the analysis of large amounts of data, keeping critical information (input data, code version, configuration parameters, output files, and results) for processes executed within its framework that can be recovered at any future time. It is a powerful tool in the era of projects such as LSST, among others.

These findings are described in the article entitled DES science portal: Computing photometric redshifts, recently published in the journal Astronomy and Computing.

Comments

READ THIS NEXT

Classifying Sesame Oil Seed Varieties And Origins Using Mass Spectrometry

The quality and authenticity of vegetable oils are of importance not only for their nutritional value but also for their […]

Exploring Fast And Low-Cost Strategies In Drug Discovery For Chagas Disease: The Alternative Quest For New Effective Treatments

From the south of United States to the south of Argentina there is an endemic illness caused by a parasite […]

Explaining Why Concussions May Activate A Pituitary “Dimmer Switch”

For a number of years, researchers have described endocrine (glandular) problems in some people with a history of concussion. These […]

Some Hints On How To Write And Publish A Good Medical Paper

One of the most noticeable shortcomings of the research career is the general lack of specific training on how to […]

Detecting Ultra-Small Plastic Micro-Particles In Water

Small particulates that exist in the environment are produced either naturally within groundwater or through anthropogenic activities. Ultra-small particles in […]

Injection Of Nanofiber “Peanuts” For Hemostasis

Noncompressible torso hemorrhage (NCTH) is a significant cause of mortality in both civilian and military settings. NCTH is a high-grade […]

New Building Blocks For Drug Discovery Are Getting Closer: Gem-difluorocyclopropane-derived Amines

Modern drug discovery relies heavily on the ability of chemists to produce good starting points for producing high-quality lead compounds. […]

Science Trends is a popular source of science news and education around the world. We cover everything from solar power cell technology to climate change to cancer research. We help hundreds of thousands of people every month learn about the world we live in and the latest scientific breakthroughs. Want to know more?