Pytorch Tensor 101

A No-Nonsense Guides to Pytorch Tensor

August 1, 2022


PyTorch is a Python-based open source and scientific computing package for building neural networks. It is dynamic graph-based framework that allows you to define your neural network in a way that is easy to understand and debug. Today, PyTorch is the most used deep learning framework and mostly use by researchers and engineers.

PyTorch support GPU acceleration (making your code run faster) behind the scenes, better than NumPy. PyTorch also provides Autograd for automatic differentiation, which means that your code is automatically differentiated and you can use it to do backpropagation

Pytoch Installation

Before you installed Pytorch, you need to install the following dependencies: Package Manager (e.g. pip, conda), Python, Numpy. For more information, please refer to the Pytorch documentation.

For me, I am using mac and conda as package manager, I therefore run the following command


To verify your installation works,

import torch

What is Tensor

  • Assume we have 3 bedrooms, 1 carpark and 2 bathrooms. We can represent this data numerically in a form of vector [3, 1,2] to describe bedrooms, carpark and bathrooms

  • Tensor are the standard way of representing data in Pytorch, such as text, images, and audio. Their job is to represent data in a numerical way.

is Tensor all you need?

  • There are many Python Data Structure for holding data including Python List and Numpy Array. List and Numpy Array operations are similar to Pytorch Tensor.

  • Let us remember the basic of data structures in Python (List and Numpy Array) before we start using Pytorch Tensor

From Python lists to Numpy Array

  • Python does not have built-in support for Arrays, but Python Lists can be used instead.

  • Using our previous example, we can create a list of Python lists below.

a_list = [3, 1,2] #A list is the Python equivalent of an array

print(a_list) # print the list
print((type(a_list))) # print the type
print(a_list[0]) # subset the list
[3, 1, 2]
<class 'list'>

However, Python lists has the following limitations: It takes large memory size and slow.

  • Numpy solved the problems with List:

    • Size - Numpy data structures take up less space

    • Performance - they have a need for speed and are faster than lists

    • Functionality - SciPy and NumPy have optimized functions such as linear algebra operations built in.

import numpy as np
a_numpy = np.array([1,3,4]) # creating a numpy array
array([1, 3, 4])
type(a_numpy) # nd arrays
a_numpy[0] # we can subset similar to Python list
a_numpy.shape # shape of the nd array
a_numpy.dtype # dtype of the nd array
a_numpy.size # size of the nd array

Performance comparison between Python lists and Numpy Arrays

import numpy as np
import time

size_of_vec = 1000

def pure_python_version():
    t1 = time.time()
    X = range(size_of_vec)
    Y = range(size_of_vec)
    Z = [X[i] + Y[i] for i in range(len(X)) ]
    return time.time() - t1

def numpy_version():
    t1 = time.time()
    X = np.arange(size_of_vec)
    Y = np.arange(size_of_vec)
    Z = X + Y
    return time.time() - t1

t1 = pure_python_version()
t2 = numpy_version()
print(t1, t2)
print("Numpy is in this example " + str(t1/t2) + " faster!")
0.00019288063049316406 0.0005578994750976562
Numpy is in this example 0.3457264957264957 faster!

From Numpy Arrays to Torch Tensor

  • Tensors are like arrays, both are data structures that are used to store data. Tensor and Numpy arrays share common operations such as shape and size.

Tensors are generalization of vectors and matrices to an arbitrary number of dimensions.

  • Similar to how Numpy provides additional support not available in the Python list, so also Tensors provides support not available in Numpy array such as:

    • GPU acceleration , which is a great advantage for deep learning,

    • distribute operations on multiple devices or machines,and

    • keep track of the graph of computations that created them ( usefull for backpropagation).

Let us Learn Tensor

Various operations are available on tensors. In the next sections, we will discuss the following operations:

  • Creating tensors.

  • Operations with tensors.

  • Indexing, slicing, and joining with tensors Computing gradients with tensors.

  • Using CUDA/MPS tensors with GPUs.

Creating tensors

  • PyTorch allows us to create tensors in many different ways using the torch package. We will discuss some of these ways.

Creating Random Tensor with a specific size

torch.tensor is a general Tensor constructor that infer the data type automatically.

import torch

a_random = torch.tensor((3,4)) # Create a random tensor
tensor([3, 4])
print(a_random.shape) # print the shape of the random tensor
print(a_random.size()) # print the size of the random tensor
print(type(a_random)) # print the type of the random tensor
print(a_random.type()) # print the type of the random tens
<class 'torch.Tensor'>

Note: .shape is an alias for .size(), and was added to closely match numpy !

  • Intead of allowing the torch.tensor to automatically determine the data type, you can explicitly specify the type of the data type by using the torch.type parameter
import torch

a_random = torch.tensor((3,4), dtype= torch.float) # Create a random tensor
tensor([3., 4.])
print(a_random.shape) # print the shape of the random tensor
print(a_random.size()) # print the size of the random tensor
print(type(a_random)) # print the type of the random tensor
<class 'torch.Tensor'>
  • You can also change an existing tensor type by using the
a_torch = torch.tensor([1, 2, 3]) 

print(a_torch.type()) # Tensor type

We can change from LongTensor t:

a_short =  a_torch.short() # Convert to short,  
a_float =  a_torch.float() # Convert to float()

print(a_short.type()) # Tensor type
print(a_float.type()) # Tensor type

Note: A variant of torch.tensor constructor is torch.FloatTensorconstructor. When use, the default tensor type is FloatTensor. Infact, torch.Tensor is an alias for the torch.FloatTensor constructor.

  • The following two examples are equivalent:
a_random = torch.Tensor((3,4)) # Create a random tensor
b_random = torch.FloatTensor((3,4)) # Create a random tensor


I would recommend to stick to torch.tensor, if you would like to change the type, you can change

Torch defines 10 tensor types with CPU and GPU variants: See different Pytorch Data Types:

  • The most common type (and generally the default) is torch.float32 or torch.float. This is referred to as “32-bit floating point”.

  • But there’s also 16-bit floating point (torch.float16 or torch.half) and 64-bit floating point (torch.float64 or torch.double).

  • The reason for all of these is to do with precision in computing. Precision is the amount of detail used to describe a number.

  • The higher the precision value (8, 16, 32), the more detail and hence data used to express a number.

  • This matters in deep learning and numerical computing because you’re making so many operations, the more detail you have to calculate on, the more compute you have to use.

So, lower precision datatypes are generally faster to compute on but sacrifice some performance on evaluation metrics like accuracy (faster to compute but less accurate).

2: Creating Tensors from Random Numbers

Similar to the numpy, we can create a tensor from a random number.

a_random_torch = torch.randn(2, 3) # uniform random distribution numbers between 0 and 1
# a_numpy_rand = np.random.randn(2,3) #numpy random normal distribution

# print(a_numpy_rand)
tensor([[ 0.0461,  0.4024, -1.0115],
        [ 0.2167, -0.6123,  0.5036]])
a_random_torch = torch.rand(2, 3) # random normal distribution
# a_numpy_rand = np.random.rand(2,3) 

# print(a_numpy_rand)
tensor([[0.7749, 0.8208, 0.2793],
        [0.6817, 0.2837, 0.6567]])

3: Creating a filled tensor

a_same_scalar = torch.zeros(3,3)
tensor([[0., 0., 0.],
        [0., 0., 0.],
        [0., 0., 0.]])
torch.Size([3, 3])
torch.ones(3, 3) # torch.ones(size=(3, 3)) 
tensor([[1., 1., 1.],
        [1., 1., 1.],
        [1., 1., 1.]])

Any PyTorch method with an underscore (_) refers to an in­place operation;

a_zero = torch.zeros(2, 3)
print(a_zero.fill_(5)) # inplace operation
print(a_zero)  # a_zero is now filled with 5
tensor([[0., 0., 0.],
        [0., 0., 0.]])
tensor([[5., 5., 5.],
        [5., 5., 5.]])
tensor([[5., 5., 5.],
        [5., 5., 5.]])

###4: Creating and initializing a tensor from lists

a_list = torch.tensor([1, 2, 3])
tensor([1, 2, 3])

5: Creating and initializing a tensor from numpy arrays

  • We use torch.from_numpy to create a tensor from a numpy array.
import numpy as np
numpy_array = np.random.rand(2, 3) 

torch_tensor = torch.from_numpy(numpy_array) # tensor from numpy array
tensor([[0.3487, 0.9072, 0.8480],
        [0.7245, 0.6970, 0.4976]], dtype=torch.float64)
  • The datatype after creating of tensor from numpy array is DoubleTensor instead of the default FloatTensor. This corresponds with the data type of the NumPy random matrix, a float64,

You can always convert from PyTorch tensors to Numpy arrays using the numpy function torch.numpy().

array([[0.3487288 , 0.90720583, 0.84795941],
       [0.72447844, 0.69699952, 0.49759155]])

6: Creating a range and tensors like

# Use torch.arange(), torch.range() is deprecated 
zero_to_ten = torch.arange(0, 10) 
tensor([0, 1, 2, 3, 4, 5, 6, 7, 8, 9])

Creating tensor of type with the same shape as another tensor.

# Can also create a tensor of zeros similar to another tensor
ten_zeros = torch.zeros_like(input=zero_to_ten) # will have same shape
tensor([0, 0, 0, 0, 0, 0, 0, 0, 0, 0])

Creating Named Tensors

  • Named Tensors allow users to give explicit names to tensor dimensions.

  • In most cases, operations that take dimension parameters will accept dimension names, avoiding the need to track dimensions by position.

torch.zeros(2, 3, names=('N', 'C'))
/var/folders/1h/b7ng0kgj3w78mg7n8k7q7rch0000gn/T/ipykernel_11570/ UserWarning: Named tensors and all their associated APIs are an experimental feature and subject to change. Please do not use them for anything important until they are released as stable. (Triggered internally at  /Users/runner/work/_temp/anaconda/conda-bld/pytorch_1654931446436/work/c10/core/TensorImpl.h:1489.)
  torch.zeros(2, 3, names=('N', 'C'))
tensor([[0., 0., 0.],
        [0., 0., 0.]], names=('N', 'C'))
  • We can use names to access tensor dimensions.
imgs = torch.randn(1, 2, 2, 3 , names=('N', 'C', 'H', 'W')) 
('N', 'C', 'H', 'W')

Tensor properties

Tensor has many properties including the following properties: the number of dimensions, the size, the type:

Tensor Dimensions

We can find the tensor dimensions using:ndim

# Scalar
scalar = torch.tensor(7)

MATRIX = torch.tensor([[1,2,3,4],


You can tell the number of dimensions a tensor in PyTorch has by the number of square brackets on the outside ([) and you only need to count one side of the brackets.

In practice, you’ll often see scalars and vectors denoted as lowercase letters such as y or a. And matrices and tensors denoted as uppercase letters such as X or W

Manipulating tensors (tensor operations)

  • In deep learning, data (images, text, video, audio, protein structures, etc) gets represented as tensors.

  • A model learns by investigating those tensors and performing a series of operations (could be 1,000,000s+) on tensors to create a representation of the patterns in the input data.

  • After you have created your tensors, you can operate on them like you would do with traditional programming language types, like +, ­, *, /.

Indexing tensors

Indexing and subsetting a tensor is similar to indexing a list.

some_list = list(range(6))
torch_list = torch.tensor(some_list)
tensor([0, 1, 2, 3, 4, 5])
print(torch_list[0]) # first element of the tensor
print(torch_list[1]) # second element of the tensor
torch_list[1:4] # subsetting a tensor
tensor([1, 2, 3])

Transposing Tensors

Transposing 2D tensors is a simple operation using t

points = torch.tensor([[4.0, 1.0], [5.0, 3.0], [2.0, 1.0]])
tensor([[4., 1.],
        [5., 3.],
        [2., 1.]])
points_t = points.t()
tensor([[4., 5., 2.],
        [1., 3., 1.]])

You can also transpose 3D and higher tensors using the transpose method by specifying the two dimensions along which transposing (flipping shape and stride) should occur:

some_t = torch.ones(3, 4, 5)
transpose_t = some_t.transpose(0, 2)
torch.Size([3, 4, 5])
torch.Size([5, 4, 3])

Tensor View Operation

Tensor view operations returns a new tensor with the same data as the self tensor but of a different shape.

x = torch.randn(2, 2)
tensor([[-0.4790,  0.8539],
        [-0.2285,  0.3081]])
torch.Size([2, 2])
y = x.view(4)
tensor([-0.4790,  0.8539, -0.2285,  0.3081])

Using -1 in the shape argument will automatically infer the correct size of the dimension.

z = x.view(-1, 2)  # the size -1 is inferred from other dimensions

tensor([[-0.4790,  0.8539],
        [-0.2285,  0.3081]])
torch.Size([2, 2])

View Does not change tensor layout in memory, Transpose() operation change the tensor layout in memory.

Tensor Mathematical Basic Operations

Tensor addition is achive using torch.add as shown in the following example:

# Create a tensor of values and add a number to it
tensor = torch.tensor([1, 2, 3])
tensor + 10
tensor([11, 12, 13])
# Multiply it by 10
tensor * 10
tensor([10, 20, 30])
# Subtract and reassign
tensor = tensor - 10
tensor([-9, -8, -7])

PyTorch also has a bunch of built-in functions like torch.mul() (short for multiplcation) and torch.add() to perform basic operations.

# Can also use torch functions
tensor = torch.tensor([1, 2, 3])
torch.multiply(tensor, 10)  # multiply by 10
tensor([10, 20, 30])
tensor = torch.tensor([1, 2, 3])

torch.add(tensor, 20) # add by 20
tensor([21, 22, 23])
torch.div(tensor, 20, rounding_mode='trunc') # divide by 20, with truncation as a rounding_mode
tensor([0, 0, 0])
torch.div(tensor, 20, rounding_mode='floor') # divide by 20, with floor as a rounding_mode
tensor([0, 0, 0])
torch.sum(tensor) # sum tensor entries  [1, 2, 3]

Matrix multiplication is all you need

  • In deep learning algorithms (like neural networks), one of the most common operations is matrix multiplication.

  • PyTorch implements matrix multiplication functionality in the torch.matmul() method.

  • The main two rules for matrix multiplication to remember are:

The inner dimensions must match:

  • (3, 2) @ (3, 2) won’t work
  • (2, 3) @ (3, 2) will work
  • (3, 2) @ (2, 3) will work

The resulting matrix has the shape of the outer dimensions:

  • (2, 3) @ (3, 2) -> (2, 2)
  • (3, 2) @ (2, 3) -> (3, 3)

Note: “@” in Python is the symbol for matrix multiplication.

More information about matrix multiplication can be found in the Matrix Multiplication section.

tensor1 = torch.randn(3, 4)
tensor2 = torch.randn(4)

torch.Size([3, 4])
result = torch.matmul(tensor1, tensor2)

Note: The difference between element-wise multiplication (multiply) and matrix multiplication (matmul) is the addition of values.

  • matmul: matrix multiplication

  • multiply: element-wise multiplication

tensor = torch.tensor([1, 2, 3])

Element-wise matrix mutlication

tensor * tensor
tensor([1, 4, 9])

Matrix multiplication

torch.matmul(tensor, tensor)

Can also use the “@” symbol or for matrix multiplication, though not recommended

print(tensor @ tensor)

A matrix multiplication like this is also referred to as the dot product of two matrices. Neural networks are full of matrix multiplications and dot products.

For example, torch.nn.Linear() module (we’ll see this in action later on), also known as a feed-forward layer or fully connected layer, implements a matrix multiplication between an input x and a weights matrix A.

\[ y = x\cdot{A^T} + b \]

