To: J3                                                     J3/19-110r2
From:    Van Snyder & Lorri Menard
Subject: Supporting rank genericity -- NOT VECTOR SUBSCRIPTS!
Date: 2019-February-12
References: 04-195, 18-247

19-110r1 adds a use case related to generic programming.

Introduction
============

1. In support of facilities in Fortran 2018
-------------------------------------------

MAXLOC(A) returns a rank-1 array whose extent is equal to the rank of A.
It cannot be used directly as a subscript for A.  One must assign it to
a variable of the appropriate rank, and use elements of that array as
subscripts.  For example

  integer :: S(3)
  real :: A(10,10,10)
  ...
  s = maxloc(a)
  print *, 'Maximum value in A is ', a(s(1),s(2),s(3))

LBOUND(A) and UBOUND(A) return rank-1 arrays whose extents are equal to
the rank of A. They cannot be used directly as bounds for an automatic
or allocatable variable.  For example

  real, intent(in) :: A(:,:,:)
  real :: X(ubound(a,1),ubound(a,2),ubound(a,3))

or

  real, allocatable :: X(:,:,:)
  ...
  allocate ( x(ubound(a,1),ubound(a,2),ubound(a,3)) )

or, if you need a halo around all dimensions of X

  allocate ( x(lbound(a,1)-1:ubound(a,1)+1, &
           &   lbound(a,2)-1:ubound(a,2)+1, &
           &   lbound(a,3)-1:ubound(a,3)+1) )

These are tedious if you know the rank, and impossible if we develop
facilities for rank-generic programming, except perhaps by using a MACRO
DO, which results in unreadable code.

History
=======

At meeting 217 on Monday 15 October /JOR announced without plenary
discussion that there would be no further discussion of 18-247.  Meeting
217 minutes state "Some features interesting; essentially vector
subscripts."  The proposal is NOT ESSENTIALLY VECTOR SUBSCRIPTS.  This
is EXPLICITLY STATED in 18-247, notwithstanding that the specifications
concerning the relationship of vector subscripts to component
initialization, association, finalization, contiguity, and I/O, would
also apply.  Read this paper all the way to the end, and read the
referenced paper 04-195 for more details.

Proposal
========

1. Array bounds use case
------------------------

Allow a single array B of rank one and extent R to specify the upper or
lower bounds of a rank R array A.  The extent of B shall be a constant,
except maybe for the A(..) case.  For example

  real, intent(in) :: A(:,:,:)
  real :: X(ubound(a))

or

  real, rank(A), allocatable :: X
  ...
  allocate ( x(ubound(a)) )

or (for the halo case)

  allocate ( x(lbound(a)-1:ubound(a)+1) )

The case of one bound being a scalar and the other not is obvious:

  allocate ( x(0:ubound(a)) )

is the same as

  allocate ( x([(0,i=1,rank(a))]:ubound(a)) )

Indeed, the default case (above) of lower bounds being all 1

  allocate ( x(ubound(a)) )

is the same as

  allocate ( x(1:ubound(a)) )

The halo case above becomes

  allocate ( x(lbound(a)-1:ubound(a)+1) )

If rank genericity is provided by the generic programming facility,
there is no other way to specify bounds for a generic-rank array, except
by something equivalent to a MACRO DO.  This results in unreadable code.

This would also be useful within a BLOCK within a SELECT RANK construct.

2. In support of generic programming
------------------------------------

This illustration presupposes generic programming is provided by
parameterized modules.

  module M ( R )
    parameters
      integer :: R ! a scalar integer used to specify array ranks
    end parameters
  contains
    subroutine P ( A )
      real, intent(inout), rank(r) :: A ! an assumed-shape array, for
      ! which a declaration using existing syntax could in principle be
      ! created by a MACRO DO statement or construct.  A subsequent
      ! reader might need to do some reverse engineering to understand
      ! the effect.
      integer :: S(r) ! S can be used as a subscript for A
      real :: X([shape(A),2]) ! rank(X) = 1 + rank(A)
      integer :: S2(r+1,2) ! S2 can be used as a subscript for X
  ....

If you don't understand the implications of this, PLEASE ASK FOR MORE
EXPLANATION!  There is nothing in DIMENSION(..) and SELECT RANK that
provides any support in this direction.

3. Subscripts
-------------

Allow a single array of rank one and extent R as the subscript of a
rank R array.  For example

  real :: A(10,10,10)
  ...
  print *, 'Maximum value in A is ', a(maxloc(a))

If S = [ 3, 4, 5 ], A(S) is a scalar, the same as A(3,4,5), not
A(3:3,4:4,5:5).

The obvious generalization here is to allow a single subscript S of rank
K+1 and extents (R,N1, ..., NK) as a subscript for a rank-R array A.
The elements of the rank-one sections in the first dimension of S are
used consecutively as subscripts for A, as in the rank one case,
resulting in a rank K object with shape [ N1, ..., NK ].  If K == 0, the
result is a scalar.

This provides a MORE GENERAL SCATTER/GATHER FACILITY THAN THE PRESENT
VECTOR SUBSCRIPT FACILITY.  This is NOT THE SAME AS USING THE ELEMENTS
OF THE RANK-ONE SECTIONS IN S AS VECTOR SUBSCRIPTS FOR A, which would
result in a rectangular section of shape [n,n] (in the case A is
of rank 2 and S is of shape [2,n]).

Example:

Suppose we have arrays A3 with dimensions (10,10,10) and S3 with
dimensions (3,2).  If we assume

                                            [ 3 6 ]
S3 = reshape( [3, 4, 5, 6, 7, 8], [3,2] ) = [ 4 7 ]
                                            [ 5 8 ]

then A3(S3) is a rank-1 extent-2 array that can appear in a variable-
definition context (except as an actual argument associated with a dummy
argument having INTENT(OUT) or INTENT(INOUT)); it specifies the same
array as [ A3(3,4,5), A3(6,7,8) ], which cannot appear in a
variable-definition context because it is not a variable.  This is
different from A3(S3(1,:),S3(2,:),S3(3,:)), which can appear in a
variable-definition context, but is an object with shape [2,2,2], not
[2].  The former is an arbitrary collection of elements of A3, while the
latter is a rectangular section of A3.

The degenerate case (same as above) is

  real :: A(10,10,10)
  integer :: S(3) = [ 3, 4, 5 ]

Then A(S) is the same as A(3,4,5), not A(3:3,4:4,5:5).  This illustrates
that a rank K+1 subscript with first extent R for a rank R array
produces a rank K result.  In this example, K = 0, so A(S) is a scalar,
not a rank-3 array with shape [1,1,1].  I.e., S is NOT A VECTOR
SUBSCRIPT.  But many of the same restrictions for vector subscripts
remain, e.g. not being an actual argument corresponding to an
INTENT(OUT) dummy argument.

This can probably be made to work with DIMENSION(..) arrays.  A run-time
check is probably required to make sure that in A(S), the first extent
of S is the same as the (dynamic) rank of A.  This might be especially
useful to create automatic variables with the same shape as
DIMENSION(..) arrays.

If rank genericity is provided by the generic programming facility,
there is no other way to specify subscripts for a generic-rank array,
except by something equivalent to a MACRO DO, which would appear
ubiquitously, in every reference, not only in declarations.  This
results in unreadable code.

Let's not do kludges and half measures, which is how we got nonpointer
optional arguments corresponding to disassociated pointers being
absent.  The original solution proposed for the combinatorial-explosion
problem was a special case of the conditional expression proposal, with
an empty ELSE part meaning "if the logical expression is true, the
argument is the consequent (not its value), else the argument is
absent."

Use Case
========

Compute a list of sets of subscripts where an array has nonzero values.

Here is an elegant rank-generic parallel solution that also depends upon
generalizing the CRITICAL construct to apply to the relationship between
iterations of a DO CONCURRENT construct (or provide a MONITOR procedure
for the same purpose).

  integer :: Subs(rank(a),size(a))
  integer :: Num_NZ
  num_nz = 0
  ! The <index-variable> I below is an array having the same extent as
  ! the rank of A.  It takes on all values of sets of subscripts of A.
  do concurrent ( integer :: I(rank(a)) = lbound(a) : ubound(a) ) &
    & shared ( num_nz )
    if ( a(i) /= 0 ) then
      critical
        num_nz = num_nz + 1 ! Presently prohibited
        subs(:,num_nz) = i
      end critical
    end if
  end do

  print '(a/1p5g15.8)', 'Nonzero elements of A:', a(subs(:,:num_nz))

or

  write ( *, '(a)' ) 'Nonzero elements of A:'
  do i = 1, num_nz
    write ( *, '(a,i0,*(",",i0:))', advance='no' ) "(", subs(:,i)
    write ( *, '(a, 1pg15.8)' ) ") ", a(subs(:,i))
  end do

Here is an equivalent rank-generic but not parallel solution using an
extension of non-concurrent DO.  The extension allows the <do-variable>
to be an array with the same extent as the loop bounds.

  ! The <do-variable> I below is an array having the same extent as the
  ! rank of A.  It takes on all values of sets of subscripts of A, in
  ! array-element order.
  do ( integer :: I(rank(a)) = lbound(a), ubound(a) )
    if ( a(i) /= 0 ) then
      num_nz = num_nz + 1
      subs(:,num_nz) = i
    end if
  end do

If these could be generated by a MACRO DO, the result would be truly
unreadable, and fragile.  The DO nesting depth would depend upon the
rank of the array.

JoR Review of Proposal
======================

There are several aspects to this proposal with different issues.
The proposal is a collection of other proposals, being pulled
together in the name of "generics", with an eye towards supporting
variables of unknown rank.

JoR agrees that it could be useful to use an integer array to hold the
bounds to be used for defining an allocatable or automatic array of
unknown rank.

The majority of this paper, however, is geared more towards other
features, not specific to dealing with arrays of unknown rank.

- The examples and text from 04-195 are exactly a "generalization of
vector-valued subscript" as given by the title of that paper.
The purpose of having a rank-N valued subscript is to provide a
general scatter/gather facility, as mentioned above and in the
referenced paper.

- The use-case of a DO variable being an array has a far-reaching
impact on implementations, program performance, readability.

- It's not clear that the CRITICAL use case is related to variables of
unknown rank, nor what MONITOR is.

- Rank-N subscript arrays in a variable-definition context will impact
many places in the compiler, in ways that are unlikely to exceed their
effectiveness

Recommendation from JoR
=======================

A general facility of rank-N subscripts should not be considered
as part of the next standard.

Features from this that specifically aid in defining or allocating
variables of unknown rank might be useful.  This will depend on
the definition "generic programming".