![]() |
CUTLASS
CUDA Templates for Linear Algebra Subroutines and Solvers
|
Mapping function for 4-D NC/xHWx tensors.
#include <tensor.h>
Public Types | |
| using | Index = int32_t |
| Index type used for coordinates. More... | |
| using | LongIndex = int64_t |
| Long index type used for offsets. More... | |
| using | TensorCoord = Tensor4DCoord |
| Logical coordinate. More... | |
| using | Stride = Coord< kStrideRank > |
| Stride vector. More... | |
Public Member Functions | |
| CUTLASS_HOST_DEVICE | TensorNCxHWx (Stride const &stride=Stride(0)) |
| Constructor. More... | |
| CUTLASS_HOST_DEVICE LongIndex | operator() (TensorCoord const &coord) const |
| Returns the offset of a coordinate in linear memory. More... | |
| CUTLASS_HOST_DEVICE Stride | stride () const |
| Returns the stride of the layout. More... | |
| CUTLASS_HOST_DEVICE Stride & | stride () |
| Returns the stride of the layout. More... | |
| CUTLASS_HOST_DEVICE LongIndex | capacity (TensorCoord const &extent) const |
| Compute the number of contiguous elements needed to store a tensor with the given size. More... | |
Static Public Member Functions | |
| static CUTLASS_HOST_DEVICE TensorNCxHWx | packed (TensorCoord const &extent) |
| Helper returns a layout to a tightly packed tensor. More... | |
Static Public Attributes | |
| static int const | kInterleave = Interleave |
| Interleaving quantity. More... | |
| static int const | kRank = 4 |
| Logical rank of tensor. More... | |
| static int const | kStrideRank = 3 |
| Rank of stride vector. More... | |
| using cutlass::layout::TensorNCxHWx< Interleave >::Index = int32_t |
| using cutlass::layout::TensorNCxHWx< Interleave >::LongIndex = int64_t |
| using cutlass::layout::TensorNCxHWx< Interleave >::Stride = Coord<kStrideRank> |
| using cutlass::layout::TensorNCxHWx< Interleave >::TensorCoord = Tensor4DCoord |
|
inline |
|
inline |
|
inline |
|
inlinestatic |
|
inline |
|
inline |
|
static |
|
static |
|
static |
1.8.11