This subroutine uses Cholesky factorization to factor a positive definite symmetric band matrix A, stored in upper- or lower-band-packed storage mode, into one of the following forms:
where, in the formulas above:
To solve the system of equations with multiple right-hand sides, follow the call to this subroutine with one of more calls to PDPBTRS. The output from this factorization subroutine should be used only as input to PDPBTRS.
If n = 0, no computation is performed and the subroutine
returns after doing some parameter checking. See references [2], [23], [39], and [40].
A, af, work | Subroutine |
Long-precision real | PDPBTRF |
Fortran | CALL PDPBTRF (uplo, n, k, a, ja, desc_a, af, laf, work, lwork, info) |
C and C++ | pdpbtrf (uplo, n, k, a, ja, desc_a, af, laf, work, lwork, info); |
If uplo = 'U', the upper triangular part is referenced.
If uplo = 'L', the lower triangular part is referenced.
Scope: global
Specified as: a single character; uplo = 'U' or 'L'.
Scope: global
Specified as: a fullword integer; 0 <= n <= (NB_A)p-mod(ja-1,NB_A).
Scope: global
Specified as: a fullword integer, where:
These limits for k are extensions of the ScaLAPACK standard.
Scope: local
Specified as: an LLD_A by (at least) LOCp(ja+n-1) array, containing numbers of the data type indicated in Table 69. Details about the block-cyclic data distribution of global matrix A are stored in desc_a.
On output, array A is overwritten; that is, original input is not preserved.
Scope: global
Specified as: a fullword integer; 1 <= ja <= N_A and ja+n-1 <= N_A.
desc_a | Name | Description | Limits | Scope |
---|---|---|---|---|
1 | DTYPE_A | Descriptor Type | DTYPE_A = 501 for 1 × p or
p × 1
where p is the number of processes in a process grid. | Global |
2 | CTXT_A | BLACS context | Valid value, as returned by BLACS_GRIDINIT or BLACS_GRIDMAP | Global |
3 | N_A | Number of columns in the global matrix |
If n = 0: N_A >= 0 Otherwise: N_A >= 1 | Global |
4 | NB_A | Column block size | NB_A >= 1 and 0 <= n <= (NB_A)p-mod(ja-1,NB_A) | Global |
5 | CSRC_A | The process column over which the first column of the global matrix is distributed | 0 <= CSRC_A < p | Global |
6 | LLD_A | Leading dimension | LLD_A >= k+1 | Local |
7 | -- | Reserved | -- | -- |
Specified as: an array of (at least) length 7, containing fullword
integers.
desc_a | Name | Description | Limits | Scope |
---|---|---|---|---|
1 | DTYPE_A | Descriptor type | DTYPE_A = 1 for 1 × p
where p is the number of processes in a process grid. | Global |
2 | CTXT_A | BLACS context | Valid value, as returned by BLACS_GRIDINIT or BLACS_GRIDMAP | Global |
3 | M_A | Number of rows in the global matrix | M_A > k | Global |
4 | N_A | Number of columns in the global matrix |
If n = 0: N_A >= 0 Otherwise: N_A >= 1 | Global |
5 | MB_A | Row block size | MB_A >= 1 | Global |
6 | NB_A | Column block size | NB_A >= 1 and 0 <= n <= (NB_A)p-mod(ja-1,NB_A) | Global |
7 | RSRC_A | The process row over which the first row of the global matrix is distributed | RSRC_A=0 | Global |
8 | CSRC_A | The process column over which the first column of the global matrix is distributed | 0 <= CSRC_A < p | Global |
9 | LLD_A | The leading dimension of the local array | LLD_A >= k+1 | Local |
Specified as: an array of (at least) length 9, containing fullword integers.
Scope: local
Specified as: for migration purposes, you should specify a one-dimensional, long-precision array of (at least) length LAF.
The laf argument must be specified; however, this subroutine currently ignores its value. For migration purposes, you should specify laf using the formula below.
Scope: local
Specified as: a fullword integer, laf >= (NB_A+2k)(k).
If lwork = 0, work is ignored.
If lwork <> 0, work is the work area used by this subroutine, where:
Scope: local
Specified as: an area of storage containing numbers of data type indicated in Table 69.
Scope:
Specified as: a fullword integer; where:
lwork >= k2
Scope: local
Returned as: an LLD_A by (at least) LOCp(ja+n-1) array, containing numbers of the data type indicated in Table 69.
On output, array A is overwritten; that is, original input is not preserved.
If lwork <> 0 or lwork <> -1, the size of work is (at least) of length lwork.
If lwork = -1, the size of work is (at least) of length 1.
Scope: local
Returned as: an area of storage, containing numbers of the data type indicated in Table 69, where:
Except for work1, the contents of work are overwritten on return.
If info = 0, global submatrix A is positive definite and the factorization completed normally, or the work area query completed successfully.
If info > 0, the leading minor of order i of the global submatrix A is not positive definite. info is set equal to i, where the first leading minor was encountered at Aja+i-1, ja+i-1. The results contained in matrix A are not defined.
Scope: global
Returned as: a fullword integer; info >= 0.
where p is the number of processes. For details, see references [2], [39], and [40]. Also, it is suggested that you specify uplo = 'L'.
The data specified for input arguments uplo, n, and k must be the same for both PDPBTRF and PDPBTRS.
The matrix A and af input to PDPBTRS must be the same as the corresponding output arguments for PDPBTRF; and thus, the scalar data specified for ja, desc_a, and laf must also be the same.
DTYPE_A | Process Grid |
---|---|
501 | p × 1 or 1 × p |
1 | 1 × p |
Matrix A must be distributed over a one-dimensional process grid, using block-cyclic data distribution. For more information on using block-cyclic data distribution, see Specifying Block-Cyclically-Distributed Matrices for the Banded Linear Algebraic Equations.
Matrix A is not positive definite. For details, see the description of the info argument.
lwork= 0 and unable to allocate workspace
Each of the following global input arguments are checked to determine whether its value differs from the value specified on process P00:
Also:
This example shows a factorization of the positive definite symmetric band matrix A of order 9 with a half bandwidth of 7:
* * | 1.0 1.0 1.0 1.0 1.0 1.0 1.0 1.0 0.0 | | 1.0 2.0 2.0 2.0 2.0 2.0 2.0 2.0 1.0 | | 1.0 2.0 3.0 3.0 3.0 3.0 3.0 3.0 2.0 | | 1.0 2.0 3.0 4.0 4.0 4.0 4.0 4.0 3.0 | | 1.0 2.0 3.0 4.0 5.0 5.0 5.0 5.0 4.0 | | 1.0 2.0 3.0 4.0 5.0 6.0 6.0 6.0 5.0 | | 1.0 2.0 3.0 4.0 5.0 6.0 7.0 7.0 6.0 | | 1.0 2.0 3.0 4.0 5.0 6.0 7.0 8.0 7.0 | | 0.0 1.0 2.0 3.0 4.0 5.0 6.0 7.0 8.0 | * *
Matrix A is stored in lower-band-packed storage mode:
* * | 1.0 2.0 3.0 4.0 5.0 6.0 7.0 8.0 8.0 | | 1.0 2.0 3.0 4.0 5.0 6.0 7.0 7.0 . | | 1.0 2.0 3.0 4.0 5.0 6.0 6.0 . . | | 1.0 2.0 3.0 4.0 5.0 5.0 . . . | | 1.0 2.0 3.0 4.0 4.0 . . . . | | 1.0 2.0 3.0 3.0 . . . . . | | 1.0 2.0 2.0 . . . . . . | | 1.0 1.0 . . . . . . . | * *
where "." means you do not have to store a value in that position in the local array. However, these storage positions are required and are overwritten during the computation.
Matrix A is distributed over a 1 × 3 process grid using block-cyclic distribution.
Notes:
ORDER = 'R' NPROW = 1 NPCOL = 3 CALL BLACS_GET (0, 0, ICONTXT) CALL BLACS_GRIDINIT(ICONTXT, ORDER, NPROW, NPCOL) CALL BLACS_GRIDINFO(ICONTXT, NPROW, NPCOL, MYROW, MYCOL) UPLO N K A JA DESC_A AF LAF WORK LWORK INFO | | | | | | | | | | | CALL PDPBTRF( 'L' , 9 , 7 , A , 1 , DESC_A , AF , 119 , WORK , 0 , INFO )
| Desc_A |
---|---|
DTYPE_ | 501 |
CTXT_ | icontxt(CGBTOO9) |
N_ | 9 |
NB_ | 3 |
CSRC_ | 0 |
LLD_A | 8 |
Reserved | -- |
Notes: |
Global matrix A stored in lower-band-packed storage mode with block size of 3:
B,D 0 1 2 * * | 1.0 2.0 3.0 | 4.0 5.0 6.0 | 7.0 8.0 8.0 | | 1.0 2.0 3.0 | 4.0 5.0 6.0 | 7.0 7.0 . | | 1.0 2.0 3.0 | 4.0 5.0 6.0 | 6.0 . . | | 1.0 2.0 3.0 | 4.0 5.0 5.0 | . . . | 0 | 1.0 2.0 3.0 | 4.0 4.0 . | . . . | | 1.0 2.0 3.0 | 3.0 . . | . . . | | 1.0 2.0 2.0 | . . . | . . . | | 1.0 1.0 . | . . . | . . . | * *
The following is the 1 × 3 process grid:
B,D | 0 | 1 | 2 -----| ------- | ------- |------- 0 | P00 | P01 | P02
Local array A with block size of 3:
p,q | 0 | 1 | 2 -----|-----------------|------------------|----------------- | 1.0 2.0 3.0 | 4.0 5.0 6.0 | 7.0 8.0 8.0 | 1.0 2.0 3.0 | 4.0 5.0 6.0 | 7.0 7.0 . | 1.0 2.0 3.0 | 4.0 5.0 6.0 | 6.0 . . | 1.0 2.0 3.0 | 4.0 5.0 5.0 | . . . 0 | 1.0 2.0 3.0 | 4.0 4.0 . | . . . | 1.0 2.0 3.0 | 3.0 . . | . . . | 1.0 2.0 2.0 | . . . | . . . | 1.0 1.0 . | . . . | . . .
Output:
Global matrix A is returned in lower-band-packed storage mode with block size of 3:
B,D 0 1 2 * * | 1.0 1.0 1.0 | 1.0 1.0 1.0 | 1.0 1.0 1.0 | | 1.0 1.0 1.0 | 1.0 1.0 1.0 | 1.0 1.0 . | | 1.0 1.0 1.0 | 1.0 1.0 1.0 | 1.0 . . | | 1.0 1.0 1.0 | 1.0 1.0 1.0 | . . . | 0 | 1.0 1.0 1.0 | 1.0 1.0 . | . . . | | 1.0 1.0 1.0 | 1.0 . . | . . . | | 1.0 1.0 1.0 | . . . | . . . | | 1.0 1.0 . | . . . | . . . | * *
The following is the 1 × 3 process grid:
B,D | 0 | 1 | 2 -----| ------- | ------- |------- 0 | P00 | P01 | P02
Local array A with block size of 3:
p,q | 0 | 1 | 2 -----|-----------------|------------------|----------------- | 1.0 1.0 1.0 | 1.0 1.0 1.0 | 1.0 1.0 1.0 | 1.0 1.0 1.0 | 1.0 1.0 1.0 | 1.0 1.0 . | 1.0 1.0 1.0 | 1.0 1.0 1.0 | 1.0 . . | 1.0 1.0 1.0 | 1.0 1.0 1.0 | . . . 0 | 1.0 1.0 1.0 | 1.0 1.0 . | . . . | 1.0 1.0 1.0 | 1.0 . . | . . . | 1.0 1.0 1.0 | . . . | . . . | 1.0 1.0 . | . . . | . . .
The value of info is 0 on all processes.