The following article is Open access

Richardson–Lucy Deconvolution with a Spatially Variant Point-spread Function of Chandra: Supernova Remnant Cassiopeia A as an Example

, , , , , and

Published 2023 July 4 © 2023. The Author(s). Published by the American Astronomical Society.
, , Citation Yusuke Sakai et al 2023 ApJ 951 59 DOI 10.3847/1538-4357/acd9b3

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/951/1/59

Abstract

Richardson–Lucy (RL) deconvolution is one of the classical methods widely used in X-ray astronomy and other areas. Amid recent progress in image processing, RL deconvolution still leaves much room for improvement under realistic situations. One direction is to include the positional dependence of a point-spread function (PSF), so-called RL deconvolution with a spatially variant PSF (RLsv). Another is the method of estimating a reliable number of iterations and their associated uncertainties. We developed a practical method that incorporates the RLsv algorithm and the estimation of uncertainties. As a typical example of bright and high-resolution images, the Chandra X-ray image of the supernova remnant Cassiopeia A was used in this paper. RLsv deconvolution enables us to uncover the smeared features in the forward/backward shocks and jet-like structures. We constructed a method to predict the appropriate number of iterations using statistical fluctuation of the observed images. Furthermore, the uncertainties were estimated by error propagation from the last iteration, which was phenomenologically tested with the observed data. Thus, our method is a practically efficient framework to evaluate the time evolution of the remnants and their fine structures embedded in high-resolution X-ray images.

Export citation and abstract BibTeX RIS

Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

Imaging analysis is critically important for studying diffuse celestial sources. X-ray astronomy, starting with the first space application of X-ray CCD in ASCA (Burke et al. 1994), has delivered detailed images of various celestial objects; e.g., supernova remnants such as SN1006 (Bamba et al. 2003) and Cassiopeia A (hereafter Cas A; Hwang et al. 2004), and galaxy clusters such as A2142 (Markevitch et al. 2000). Since the X-ray mirrors used in Chandra are the largest and most precisely built, exceeding the angular resolution of Chandra is considered to be challenging. Therefore, enhancing the technique of imaging analysis has been an essential direction to utilize the highest spatial resolution and data accumulated over decades.

X-rays are collected primarily by total reflection from the surface of an X-ray mirror, therefore the response function for the distribution of focused X-rays, called the point-spread function (PSF), is nearly energy independent. On the condition that a PSF is independent of incoming photon energy and the position of the focal plane, a reverse calculation of a convolution of PSF, so-called image deconvolution (see the review on deconvolution in astronomy by Starck et al. 2002), is highly simplified. There are various deconvolution methods proposed by assuming that a PSF is constant during the deconvolution process, such as the deconvolution of Suzaku XIS (Sugizaki et al. 2009). One of the latest examples is the image restoration algorithm Expectation via Markov chain Monte Carlo (Esch et al. 2004). It is applied to the double active galactic nuclei in NGC 6240 (e.g., Fabbiano et al. 2020; Paggi et al. 2022), succeeding in finely resolving the two cores. Similarly, a classical method, Richardson–Lucy (RL) deconvolution proposed by Richardson (1972) and Lucy (1974), is often used (e.g., Grefenstette et al. 2015; Thimmappa et al. 2020; Sobolenko et al. 2022).

The choice of method depends on the trade-off between accuracy and computational cost. Relaxing the condition that a PSF is positional-independent and/or energy-independent, the deconvolution methods increase the complexity of the calculation. RL deconvolution is one of the simplified methods but still has room for improvement in practical situations. In gamma-ray astronomy, a PSF can change by one order of magnitude with energy and incident angle; it is calculated for each event, e.g., RL algorithm optimized for Fermi-LAT and EGRET (Tajima et al. 2007). In contrast, as the number of photons is much larger in X-ray astronomy, event-by-event reconstruction is less practical; image-based reconstruction thus can be the first choice in X-rays. However, there are few studies on extending the RL method, especially their application to diffuse sources obtained by Chandra. We therefore explored its applicability to the Chandra data and considered the associated systematic errors.

In this paper, we implement RL deconvolution with a spatially variant PSF (RLsv) algorithm, assuming it to be used for Chandra images. Section 2 describes the principle of the RLsv method. One of the technical difficulties is reducing computational cost in calculating PSFs. This is solved by decimating the sampling interval of PSFs, while a side-effect is discussed in Section 5.1. Section 3 presents an example of its application to a diffuse source observed by Chandra. We apply the RLsv method to the supernova remnant Cas A as an example, because Cas A is bright and extended over the entire field of view of the ACIS detector, which would be the best target for the first application. The remnant is intensively studied because of its unique structure and evolution, e.g., the velocities and thickness of shocked filaments (Patnaude & Fesen 2009; Sato et al. 2018; Tsuchioka et al. 2022), where the method can contribute to advancing our understanding of the phenomena. In Section 4, we propose a reliable number of stop iterations and uncertainties of the method. We develop the method to estimate the number of convergent iterations by generating fluctuations due to statistical errors during iteration. Furthermore, the uncertainty on the RLsv-deconvolved image is estimated using the law of error propagation (e.g., Ku 1966). As a result, filaments and ambiguous structures of Cas A are deconvolved to be sharper with some knowledge of the statistical uncertainties.

2. Method

2.1. RL Deconvolution

The RL algorithm iteratively estimates a true image from an observed image using Bayesian inference. It generally assumes that the PSF does not change with position in the image. The RL algorithm is expressed by

Equation (1)

where i and j are mapping the image in the sky, and k is mapping the image on the detector. The indices of the summation run through all the pixels. W(r) is the restored image after r iterations, and H is the observed image on the ACIS detector. Pjk is the probability that a photon emitted in sky W bin j is measured in data space H bin k, or P(Hk Wj ).

2.2. RL with a Spatially Variant PSF

Previous Chandra image deconvolution approaches (e.g., Thimmappa et al. 2020; Sobolenko et al. 2022) used a simplified approximation for the Pjk values, i.e., they used the same PSF for each j bin. Here we assume that the PSF changes as a function of the off-axis angle and the roll angle. As a consequence, the Chandra RL algorithm is extended. The formula for RLsv is obtained by rewriting Equation (1) as

Equation (2)

Pjjk refers to a PSF at a position j (first index) which returns a probability that an event emitted at Wj (second index) is observed at Hk (third index), or Pj (Hk Wj ). Computational cost and memory requirements need to be minimized for calculating the third-order tensor of the PSF, which is a distinctive feature of the RLsv algorithm. When H is corrected for slight differences among pixels in effective area and exposures, its normalization can be chosen arbitrarily. Here we use Hk = Nk /Ak , where Nk is the detector count image, and Ak is the Hadamard product of effective area and exposure time.

2.3. RLsv Deconvolution with Total Variation Regularization

There are regularization techniques to enhance the RL method, which are also readily available for the RLsv method. Among these, total variation (TV) regularization (Rudin et al. 1992) is effective in handling statistical errors, which is used in the RL method (Dey et al. 2006). The formula for RLsv with the regularization is obtained by rewriting Equation (2) as

Equation (3)

The only difference from the RLsv algorithm of Equation (2) is the regularization term of $1-{\lambda }_{\mathrm{TV}}{\rm{div}}({\rm{\nabla }}{W}_{i}^{(r)}/| {\rm{\nabla }}{W}_{i}^{(r)}| )$, where λTV is the regularization parameter, ${\rm{div}}(\cdot )$ is the divergence, and ${\rm{\nabla }}{W}_{i}^{(r)}$ is the gradient of ${W}_{i}^{(r)}$. In this paper, we utilize the parameter λTV = 0.002, as proposed by Dey et al. (2006).

2.4. Comparison to Other Methods

Deconvolution methods require an understanding of their applicability to a practical condition, as well as optimization of computation cost and accuracy (for features of various methods see Naik & Sahu 2013). The RL method is well studied and has been used to incorporate regularization (e.g., van Kempen & van Vliet 2000; Dey et al. 2006; Yuan et al. 2008; Yongpan et al. 2010) and the recent trend of deep learning (e.g., Agarwal et al. 2020). For Chandra users, RL deconvolution with a single PSF is frequently used because the method is already implemented as arestore in the Chandra Interactive Analysis of Observations (CIAO; Fruscione et al. 2006), Chandra's standard data processing package.

Compared to other methods, the RL method forces the deconvolved image of each iteration to be non-negative, and its integral value is conserved. Additionally, the method converges to the maximum likelihood solution for a Poisson noise distribution (Shepp & Vardi 1982), which is suitable for Chandra images with noise from counting statistics. Depending on the application, it is less prone to ringing artifacts than inverse PSF-based methods (e.g., Sekko et al. 1999; Neelamani et al. 2004); see the results of the comparison by Dalitz et al. (2015). According to White (1994), it is robust against small errors in the PSF.

3. Application to Observed Data

3.1. Data Selection

Because Cas A is a bright and diffuse X-ray source with a moderately large apparent diameter, it is an ideal target to demonstrate the RLsv method. It has been observed by Chandra almost every year since 1999. The Chandra data of Cas A used in this paper are listed in Table 1: ACIS-S observation of 2004 in Obs. ID = 4636, 4637, 4639, and 5319. The image size is 1489 × 1488 pixels, or 743'' × 742'' given a unit pixel of 0farcs492. Data processing and analysis were performed using CIAO version 4.13. The data were reprocessed from the level 1 event files by chandra_repro. Since the roll angle and optical axis of the four observations are almost the same (the maximum difference of the optical axis location is about 4 unit pixels), all the events were merged into one by merge_obs. The total exposure time was 428.29 ks.

Table 1. Basic Information on the Chandra Observations of Cas A Used in this Paper

Obs. IDObs. StartExp. TimeDetectorR.A.Decl.Roll
 yyyy mmm dd(ks) (deg)(deg)(deg)
46362004 Apr 20143.48ACIS-S350.912958.841249.7698
46372004 Apr 22163.50ACIS-S350.913158.841449.7665
46392004 Apr 2579.05ACIS-S350.913258.841549.7666
53192004 Apr 1842.26ACIS-S350.912758.841149.7698
51962004 Feb 849.53ACIS-S350.912958.7933325.5035

Download table as:  ASCIITypeset image

3.2. Generating the PSF of Chandra

The Chandra telescope system consists of four pairs of nested reflecting surfaces, configured in the Wolter type I geometry. The high-energy response is achieved by coating the mirrors with iridium. It has attained the highest angular resolution of 0farcs492 among existing X-ray telescopes. Its mirror has been extensively calibrated on the ground and in orbit (Jerius et al. 2000). The Chandra PSF is positional-dependent, mainly due to aberrations. The RLsv method includes the position dependence of the PSF, which is useful for highly extended X-ray sources.

Because creating a PSF for each position is computationally expensive, it is decimated at some intervals. For reference, creating a PSF takes several seconds, depending on the computational environment and desired accuracy. The sampling interval of the PSFs was chosen to be 35 × 35 pixels (total of 43 × 43 = 1849). The interval was determined empirically by trying several different ranges. In general, the PSFs simulated from each observation should be merged for a precise calculation. Here, the PSF of the Obs. ID of 4636 was used as a representative since its sampling is decimated. The PSFs at the lattice points were generated by CIAO's simulate_psf using the Model of AXAF Response to X-rays (Wise et al. 1997; Davis et al. 2012) at a monochromatic energy of 2.3 keV. They were applied to the observed image with energies from 0.5 to 7.0 keV.

Figure 1 shows all the PSFs sampled every 35 × 35 pixels. The optical axis is located at the northeast in the image, where the spread of the PSF is minimum. As the position is away from the optical axis, the tail of the PSF increases with its gradual shift of the elliptical axis. Although it is a trade-off with photon statistics, it is effective to run the RLsv method with the optimal monoenergetic PSF for each of the multiple energy decompositions (see Figure 2). This is because, at shorter wavelengths, the effect of diffuse reflection due to the roughness of the mirror surface is not negligible (Jerius et al. 2000).

Figure 1.

Figure 1. Cas A image (Obs. ID = 4636) and the two-dimensional probabilities of the point-spread functions (PSFs). The integral of each PSF is normalized to be 1. The PSF color scale is a fixed range. The location of the optical axis is indicated with a green cross.

Standard image High-resolution image

3.3. Results of the RLsv Method

Figure 3(a) is an observed ∼400 ks image using the energy range from 0.5 to 7.0 keV, as explained in Section 3.1. We applied the RLsv method to the image with a sampling interval of each PSF of 35 × 35 pixels. The number of iterations is 200. Note that the choice of iteration number is discussed in Section 4.1. The result of the RLsv method is presented in Figure 3(b). The unit of flux and its range in Figure 3(b) is the same as in Figure 3(a). The overall structures in the RLsv image become more vivid than the original ones. The images around the off-axis are significantly improved compared to those around the optical axis.

Figure 2.

Figure 2. (a) X-ray RGB (red: 0.2–1.2 keV, green: 1.2–2.0 keV and blue: 2.0–7.0 keV) band images of Cas A obtained with Chandra. (a-1, -2) Enlarged images specified by the colored frames in (a). (b) Same as (a) except for RLsv-deconvolved in each energy band. The unit of flux in the images is photons cm−2 s−1.

Standard image High-resolution image
Figure 3.

Figure 3. (a) X-ray image in the 0.5–7.0 keV band of Cas A obtained with Chandra. (a-1, -2, -3) Enlarged images specified by the colored frames in (a). (b) Same as (a), but for the RLsv-deconvolved results. The unit of flux in the images is photons cm−2 s−1.

Standard image High-resolution image

To make the differences more precise, we present magnified images of the original image in Figures 3(a-1, -2, -3) and the RLsv-image in Figures 3(b-1, -2, -3). The three regions represent a sharp filament in the northeast, complicated filaments in the north, and a slightly diffuse area in the south. The filamentary structures in the northeast and north become sharper in the RLsv-image. We will quantify the filament width in detail and discuss the systematic uncertainties associated with the method in Section 4.2.

4. Uncertainty Estimation

4.1. Assessment of the Reasonable Number of Iterations

We considered a way of assessing an appropriate number of iterations, which is one of the issues with the RL method. This is because the method has the property of excessive amplification of noise as the number of iterations increases. We propose a method to suppress convergence using statistical errors during the iteration. The formula of the method is written as

Equation (4)

The only difference from the RLsv algorithm, Equation (2), is the G(Nk )/Ak term. N (counts) is the map of detector counts. A (photons cm−2 s−1) is the Hadamard product of effective area and exposure time. G(Nk ) is a random number generator following a Poisson distribution with a count in the kth pixel of Nk ; i.e., G(Nk )/Ak is a flux in units of photons cm−2 s−1. The reason for normalizing G(N) by dividing it with A is to account for the slight variations in effective area and exposure time among pixels.

The performance of the RLsv algorithm using Equation (4) is compared to that using Equation (2). The convergence is evaluated using the mean squared error (MSE) of the two images: one step before and after iterations. Figure 4 shows the history of the MSE during the iteration. The curve obtained by the RLsv algorithm, Equation (4), saturates at a certain level, while the other continues to decrease. The saturation level is caused by the injection of Poisson fluctuation at each step, which is considered an indicator of stopping. In Figure 4, the iteration number of ∼30 seems appropriate.

Figure 4.

Figure 4. Residuals of the two images before and after iterations for the entire region of Cas A vs. the number of iterations. The results of the RLsv method with and without statistical errors are plotted as a blue and an orange line, respectively.

Standard image High-resolution image

4.2. Assessment of Image Blurredness

We then designed a simplified method for evaluating a certain amount of confidence. The RLsv-deconvolved image should have a similar amount of fluctuation accompanying the observation image. The principle of the method is to propagate errors of the converged RLsv image into the next step. Our choice of using the last step for the error propagation is just to simplify the task. Here, each error in the observed image is considered statistically independent. Assuming only uncertainties on Hk , using the law of error propagation, the image uncertainty can be expressed as

Equation (5)

where ${W}^{{\prime} }$ is the image of the next iteration number of any estimated true image of W, and $\sqrt{{N}_{k}}$ is the statistical error of Nk .

We compared the off-axis RL image with the error of Equation (5) to an on-axis observation in Figure 5. Figures 5(a) and (b) are the off-axis southeastern images from Figures 3(a) and (b), respectively. Figure 5(c) shows a southeastern on-axis image of Obs. ID = 5196. The exposure time for the on-axis observation is ∼50 ks, resulting in a larger statistical error compared to the off-axis observation of ∼400 ks. The fan-shaped regions in Figures 5(a)–(c), along the filament, are chosen to create the radial profiles, which is the one-dimensional profile of the photons in each region extending from the central compact object (CCO) of Cas A toward the outer regions. These radial profiles were created using dmextract in CIAO. In Figure 5(d), the framed regions from Figures 5(a)–(c) are color-coded as blue, orange, and black, respectively. The error bars of the radial profiles in Figure 5(d) correspond to statistical errors, represented by blue and black, and the result obtained by applying Equation (5) to the 199th iteration of the RLsv image, is indicated by orange. From Figure 5(d), the profile of the off-axis RLsv image agreed with that of the on-axis image within the statistical errors. This method gives a guideline for a certain level of confidence associated with the RLsv method.

Figure 5.

Figure 5. Images and radial profiles of the southeastern filament of Cas A. (a) Off-axis image of Obs. ID = 4636, 4637, 4639, and 5319. (b) Result of the RLsv-image of (a). (c) On-axis image of Obs. ID = 5196. (d) Results of the radial profile in (a), (b), and (c), radially projected from the central compact object (CCO) using the fan-shaped regions in green. The horizontal axis represents the distance from the CCO.

Standard image High-resolution image

5. Discussion

5.1. Enhancement Technique and Possibilities

In this section, further enhancements to the RLsv method are discussed. The first is to reduce the loss of down-sampling PSFs. The positional dependence of the PSF does not contain high-frequency components, so the decimation of PSF sampling should work to some extent. For a small image such as a core plus jet structure in an active galactic nucleus, keeping a high sampling rate of the PSFs might be possible. However, for a largely extended source such as a supernova remnant or galaxy cluster, to minimize the sampling rate is critically important for practical use. The higher the decimation, the more emphasized the boundary of the segment. Taking Cas A as an example, the edges of specific segments clearly appear when the sampling interval is 35 × 35 pixels. To smooth out the edges, we propose that the PSFs' boundaries be randomly selected from nearby PSFs (see more details in the Appendix).

Second, this method can be developed by incorporating several regularization methods. We implemented an RLsv method incorporating the TV regularization expressed in Equation (3). Finally, the RLsv method is naturally applied to color images. By decomposing observed images into several colors (or energy bands) and generating PSFs for an appropriate energy in each band, an energy-dependent RLsv method can be realized.

We compare the RLsv method with and without the TV regularization. We use the same image as in Section 3.1. The PSF sampling is 35 × 35 pixels and the number of iterations is 200. The PSFs' boundaries are randomly selected following the Appendix. Figure 6 presents the enlarged eastern image after applying the RLsv method to the entire region. Figures 6(a) and (b) show the results of the RLsv method of Equation (2) and the regularization version of Equation (3), respectively. The TV regularization preserves the sharp structure and smoothes out statistical errors. In this way, regularization can be added to the RLsv method.

Figure 6.

Figure 6. Comparison of the results of the RLsv method without and with total variation regularization, shown in (a) and (b) respectively.

Standard image High-resolution image

We implement the RLsv method including these enhancements and adapt it to the Cas A observational data described in Section 3.1. Figure 2(a) is the observed image of Section 3.3 divided into three energies in RGB: 0.2–1.2 keV (red), 1.2–2.0 keV (green), and 2.0–7.0 keV (blue). Cas A is dominated by the thermal radiation in ≤4 keV and the nonthermal radiation in ≥4 keV. We applied the RLsv method with the TV regularization Equation (3) to each energy image of Figure 2(a) using the appropriate energy of the PSF (red is 0.92 keV, green is 1.56 keV, and blue is 3.8 keV) based on the official CIAO page. 4 The sampling interval of PSFs is 35 × 35 pixels. The PSF is randomly selected at the sampling boundaries according to the Appendix. The number of iterations is 30, according to Section 4.2. The result of the RLsv method is presented in Figure 2(b). The energy dependence in Figures 2(b-1) and (b-2) are clearly visible by this method.

5.2. Constraint on the Uncertainty

We propose two complementary ways to obtain a guideline on the stop condition of the RLsv method. One is to obtain a minimum of residuals by inserting statistical uncertainties into each update during the iteration process. This gives a rough estimate of the limit of the iterations. The other is to include the statistical uncertainties in the last step of the iteration. By combining the two methods, it is possible to derive uncertainties using the errors obtained by the latter method at the optimal iteration number estimated by the former. This is a quick and convenient way to derive the systematic uncertainties associated with the RLsv method.

The systematic uncertainty in the former method is a way of defining the optimal number of iterations. One easy way is to use the same level of residuals as shown in Section 4.1. It is intrinsically difficult to distinguish the signal of the celestial objects from the statistical noise. This difficulty needs to be overcome by comparing the deconvolved images without errors and the error-estimated images around the optimal iteration number recommended by the former method. This method is based on a compromise between the computational cost and the simplicity of use while keeping a reasonable statistical error.

6. Conclusion

We have improved the processing capability of RL deconvolution by incorporating the positional dependency of the Chandra PSF. The RLsv method is applied to the entire region of Cas A with an estimation of its limit and errors, which are based on the phenomenological method for evaluating a reasonable number of iterations and uncertainties. It shows that the features of shock waves and jets are sharper than those measured in the original image, with a certain amount of knowledge of the associated errors. The RLsv-deconvolved profile of the off-axis image at the southeastern filament became shaper and agreed with that of the on-axis observation within the statistical errors. This method is useful for a detailed diagnosis of other extended X-ray sources obtained by Chandra. The code used in this paper is available at doi:10.5281/zenodo.8020557.

Acknowledgments

We would like to thank the anonymous referee for helpful comments and feedback on this paper. This research has made use of data obtained from the Chandra Data Archive and the Chandra Source Catalog, and software provided by the Chandra X-ray Center (CXC) in the application packages CIAO. This work was supported by JSPS KAKENHI grant Nos. 20K20527, 22H01272, and 20H01941.

Appendix: Boundaries of the PSF

Decimating the sampling number of PSFs is an effective approach to minimize computational cost. However, this technique can introduce side-effects at the boundary of segments when switching between PSFs. The variation in shape between neighboring PSFs, caused by the sampling interval, leads to discontinuities in the deconvolution process. To mitigate this issue, a simple countermeasure is to randomly select adjacent PSFs at their boundaries, which helps to smooth out the discontinuities. The weights of the probabilities for selecting PSFs are illustrated in Figure 7. The presence and severity of artifacts depend on factors such as the dissimilarity in shape between neighboring PSFs, statistical characteristics of the observed images, and other relevant factors. Therefore, the presented technique serves as an example, and the problem is optimized by the range of pixels to be randomized.

Figure 7.

Figure 7. Illustration of the 9 × 9 pixels around the intersection of the PSF switchover, overlaid with the probability weights on selecting PSFs. The quadrants are named A, B, C, and D for illustrative purposes. The weights of the probabilities are either 1/3 or 2/3 at the two boundaries, while they are 1/9, 2/9, or 4/9 at the corners.

Standard image High-resolution image

The result of applying the selection rule in Figure 7 is shown in Figure 8. We compared the RLsv method for the observed data in the eastern region in Section 3.1 with a PSF sampling of 35 × 35 pixels and 200 iterations. Figure 8(a) shows the PSF images corresponding to Figures 8(b) and (c), where the white lines are used to clarify the border lines. Figures 8(b) and (c) are the RLsv-deconvolved images with and without using the randomization of the PSFs, respectively. To illustrate the differences, we present magnified images of Figures 8(b) and (c) as Figures 8(b-1) and (c-1), respectively. It appears that the discontinuity at the boundaries of the PSFs, indicated by the green arrows, is smeared out to some extent.

Figure 8.

Figure 8. Comparison of the RLsv method without and with PSF randomization at the boundaries. (a) PSF images corresponding to (b) and (c). (b) RLsv-image without correcting the PSF boundaries. (b-1) Enlarged images specified by the colored frames in (b). The arrows correspond to the boundaries of the PSFs. (c) Same as (b), but using the randomization of the PSFs.

Standard image High-resolution image

Footnotes

Please wait… references are loading.
10.3847/1538-4357/acd9b3