วิธีเพิ่มคอลัมน์พิเศษลงในอาร์เรย์ NumPy

292

สมมติว่าฉันมีอาร์เรย์ NumPy a:

a = np.array([
    [1, 2, 3],
    [2, 3, 4]
    ])

และฉันต้องการเพิ่มคอลัมน์ศูนย์เพื่อรับอาร์เรย์b:

b = np.array([
    [1, 2, 3, 0],
    [2, 3, 4, 0]
    ])

ฉันจะทำสิ่งนี้ได้อย่างง่ายดายใน NumPy

python numpy

— Peter Smit
แหล่งที่มา

181

ฉันคิดว่าทางออกที่ตรงไปตรงมามากขึ้นและการบูตเร็วกว่าก็คือทำสิ่งต่อไปนี้:

import numpy as np
N = 10
a = np.random.rand(N,N)
b = np.zeros((N,N+1))
b[:,:-1] = a

และกำหนดเวลา:

In [23]: N = 10

In [24]: a = np.random.rand(N,N)

In [25]: %timeit b = np.hstack((a,np.zeros((a.shape[0],1))))
10000 loops, best of 3: 19.6 us per loop

In [27]: %timeit b = np.zeros((a.shape[0],a.shape[1]+1)); b[:,:-1] = a
100000 loops, best of 3: 5.62 us per loop

— JoshAdel
แหล่งที่มา

16

ฉันต้องการผนวก (985,1) รูปร่าง np araay ไปยัง (985,2) np array เพื่อให้อาร์เรย์ np (985,3) np แต่มันไม่ทำงาน ฉันได้รับ "ไม่สามารถออกอากาศอาเรย์อินพุตจากรูปร่าง (985) เป็นรูปร่าง (985,1)" ข้อผิดพลาด เกิดอะไรขึ้นกับรหัสของฉัน รหัส: np.hstack (data, data1)

— Outlier

5

@ ก่อนหน้าคุณควรโพสต์คำถามใหม่แทนที่จะถามคำถามในความคิดเห็นของคำถามนี้

— JoshAdel

4

@JoshAdel: ฉันลองรหัสของคุณบน ipython และฉันคิดว่ามีข้อผิดพลาดทางไวยากรณ์ คุณอาจต้องการลองเปลี่ยนa = np.random.rand((N,N))เป็นa = np.random.rand(N,N)

— hlin117

ฉันเดาว่านี่เป็น overkill สำหรับสิ่งที่ OP ขอ คำตอบของสหกรณ์คือ apt!

— lft93ryt

นี่เป็นเพียงกลลวงในการเพิ่มผนวกหรือแทรกหรือสแต็ก และไม่ควรได้รับการยอมรับว่าเป็นคำตอบ วิศวกรควรพิจารณาใช้คำตอบด้านล่าง

— cinqS

326

np.r_[ ... ]และnp.c_[ ... ] เป็นทางเลือกที่มีประโยชน์กับvstackและhstackโดยมีเครื่องหมายวงเล็บสี่เหลี่ยม [] แทน round ()
ตัวอย่างสองตัวอย่าง:

: import numpy as np
: N = 3
: A = np.eye(N)

: np.c_[ A, np.ones(N) ]              # add a column
array([[ 1.,  0.,  0.,  1.],
       [ 0.,  1.,  0.,  1.],
       [ 0.,  0.,  1.,  1.]])

: np.c_[ np.ones(N), A, np.ones(N) ]  # or two
array([[ 1.,  1.,  0.,  0.,  1.],
       [ 1.,  0.,  1.,  0.,  1.],
       [ 1.,  0.,  0.,  1.,  1.]])

: np.r_[ A, [A[1]] ]              # add a row
array([[ 1.,  0.,  0.],
       [ 0.,  1.,  0.],
       [ 0.,  0.,  1.],
       [ 0.,  1.,  0.]])
: # not np.r_[ A, A[1] ]

: np.r_[ A[0], 1, 2, 3, A[1] ]    # mix vecs and scalars
  array([ 1.,  0.,  0.,  1.,  2.,  3.,  0.,  1.,  0.])

: np.r_[ A[0], [1, 2, 3], A[1] ]  # lists
  array([ 1.,  0.,  0.,  1.,  2.,  3.,  0.,  1.,  0.])

: np.r_[ A[0], (1, 2, 3), A[1] ]  # tuples
  array([ 1.,  0.,  0.,  1.,  2.,  3.,  0.,  1.,  0.])

: np.r_[ A[0], 1:4, A[1] ]        # same, 1:4 == arange(1,4) == 1,2,3
  array([ 1.,  0.,  0.,  1.,  2.,  3.,  0.,  1.,  0.])

(เหตุผลสำหรับวงเล็บเหลี่ยม [] แทนที่จะเป็นกลม () คือ Python ขยายตัวอย่างเช่น 1: 4 ในรูปสี่เหลี่ยมจัตุรัส - สิ่งมหัศจรรย์ของการบรรทุกเกินพิกัด)

— เดนิส
แหล่งที่มา

7

เพิ่งหาข้อมูลเกี่ยวกับสิ่งนี้และแน่นอนว่านี่เป็นคำตอบที่ดีกว่าคำตอบที่ยอมรับเพราะมันครอบคลุมการเพิ่มคอลัมน์พิเศษที่จุดเริ่มต้นและท้ายที่สุดไม่ใช่แค่ตอนท้ายเหมือนคำตอบอื่น ๆ

— Ay0

2

@ Ay0 แน่นอนฉันกำลังมองหาวิธีเพิ่มหน่วยอคติในเครือข่ายเซลล์ประสาทเทียมของฉันเป็นชุดในทุกชั้นในคราวเดียวและนี่คือคำตอบที่สมบูรณ์แบบ

— gaborous

และสิ่งที่ถ้าคุณต้องการเพิ่มnคอลัมน์ในเวลาหรือไม่

— Riley

1

@Riley คุณช่วยยกตัวอย่างได้ไหม Python 3 มี "การแกะกล่องซ้ำได้" เช่นnp.c_[ * iterable ]; เห็นการแสดงออกของรายการ

— เดนิส

@ เดนิสนั่นคือสิ่งที่ฉันกำลังมองหา!

— Riley

148

การใช้numpy.append:

>>> a = np.array([[1,2,3],[2,3,4]])
>>> a
array([[1, 2, 3],
       [2, 3, 4]])

>>> z = np.zeros((2,1), dtype=int64)
>>> z
array([[0],
       [0]])

>>> np.append(a, z, axis=1)
array([[1, 2, 3, 0],
       [2, 3, 4, 0]])

— gemy
แหล่งที่มา

3

นี่เป็นสิ่งที่ดีเมื่อแทรกคอลัมน์ที่ซับซ้อนมากขึ้น

— โทมัส Ahle

6

นี่ตรงไปตรงกว่าคำตอบของ @JoshAdel แต่เมื่อต้องรับมือกับชุดข้อมูลขนาดใหญ่มันจะช้ากว่า ฉันเลือกระหว่างสองสิ่งนี้ขึ้นอยู่กับความสำคัญของการอ่านได้

— dvj

3

appendจริง ๆ แล้วแค่โทรconcatenate

— rll

53

วิธีหนึ่งในการใช้hstackคือ:

b = np.hstack((a, np.zeros((a.shape[0], 1), dtype=a.dtype)))

— Peter Smit
แหล่งที่มา

2

ฉันคิดว่านี่เป็นทางออกที่ดีที่สุด

— silvado

2

+1 - นี่คือวิธีที่ฉันจะทำ - คุณเอาชนะฉันเพื่อโพสต์เป็นคำตอบ :)

— แบลร์

3

ลบdtypeพารามิเตอร์มันไม่จำเป็นและไม่อนุญาต ในขณะที่โซลูชันของคุณหรูหราเพียงพออย่าใช้มันหากคุณต้องการ "ผนวก" บ่อยๆกับอาเรย์ หากคุณไม่สามารถสร้างอาร์เรย์ทั้งหมดในครั้งเดียวและเติมในภายหลังสร้างรายการของอาร์เรย์และhstackทั้งหมดในครั้งเดียว

— eumiro

1

@ eumiro ฉันไม่แน่ใจว่าฉันจัดการเพื่อให้ได้ dtype ในตำแหน่งที่ไม่ถูกต้องได้อย่างไร แต่ np.zeros ต้องการ dtype เพื่อหลีกเลี่ยงทุกสิ่งที่กำลังลอย (ในขณะที่ int)

— Peter Smit

42

ฉันพบว่าสง่างามที่สุดต่อไปนี้:

b = np.insert(a, 3, values=0, axis=1) # Insert values before column 3

ข้อได้เปรียบinsertคือมันช่วยให้คุณสามารถแทรกคอลัมน์ (หรือแถว) ในที่อื่น ๆ ภายในอาร์เรย์ นอกจากนี้แทนที่จะใส่ค่าเดียวคุณสามารถแทรกทั้งเวกเตอร์ได้อย่างง่ายดายเช่นทำซ้ำคอลัมน์สุดท้าย:

b = np.insert(a, insert_index, values=a[:,2], axis=1)

ซึ่งนำไปสู่:

array([[1, 2, 3, 3],
       [2, 3, 4, 4]])

สำหรับช่วงเวลานั้นinsertอาจช้ากว่าโซลูชันของ JoshAdel:

In [1]: N = 10

In [2]: a = np.random.rand(N,N)

In [3]: %timeit b = np.hstack((a, np.zeros((a.shape[0], 1))))
100000 loops, best of 3: 7.5 µs per loop

In [4]: %timeit b = np.zeros((a.shape[0], a.shape[1]+1)); b[:,:-1] = a
100000 loops, best of 3: 2.17 µs per loop

In [5]: %timeit b = np.insert(a, 3, values=0, axis=1)
100000 loops, best of 3: 10.2 µs per loop

— Björn
แหล่งที่มา

1

อันนี้ค่อนข้างเรียบร้อย เสียดายที่ฉันไม่สามารถinsert(a, -1, ...)เพิ่มคอลัมน์ต่อท้ายได้ เดาว่าฉันจะเติมมันแทน

— โทมัส Ahle

2

@ThomasAhle a.shape[axis]คุณสามารถผนวกแถวหรือคอลัมน์โดยได้รับขนาดในแกนว่าการใช้ I. e. สำหรับท้ายแถวที่คุณทำและคอลัมน์ที่คุณทำnp.insert(a, a.shape[0], 999, axis=0) np.insert(a, a.shape[1], 999, axis=1)

— blubberdiblub

35

ฉันสนใจคำถามนี้และเปรียบเทียบความเร็วของ

numpy.c_[a, a]
numpy.stack([a, a]).T
numpy.vstack([a, a]).T
numpy.ascontiguousarray(numpy.stack([a, a]).T)               
numpy.ascontiguousarray(numpy.vstack([a, a]).T)
numpy.column_stack([a, a])
numpy.concatenate([a[:,None], a[:,None]], axis=1)
numpy.concatenate([a[None], a[None]], axis=0).T

aซึ่งทุกคนทำในสิ่งเดียวกันสำหรับเวกเตอร์ป้อนข้อมูลใด ๆ กำหนดเวลาสำหรับการเติบโตa:

โปรดทราบว่าในที่สุดตัวแปรที่ไม่ต่อเนื่องทั้งหมด(โดยเฉพาะอย่างยิ่ง stack/ vstack) จะเร็วกว่าตัวแปรที่ต่อเนื่องกันทั้งหมด column_stack(สำหรับความชัดเจนและความเร็ว) ดูเหมือนจะเป็นตัวเลือกที่ดีหากคุณต้องการความต่อเนื่อง

รหัสในการทำซ้ำพล็อต:

import numpy
import perfplot

perfplot.save(
    "out.png",
    setup=lambda n: numpy.random.rand(n),
    kernels=[
        lambda a: numpy.c_[a, a],
        lambda a: numpy.ascontiguousarray(numpy.stack([a, a]).T),
        lambda a: numpy.ascontiguousarray(numpy.vstack([a, a]).T),
        lambda a: numpy.column_stack([a, a]),
        lambda a: numpy.concatenate([a[:, None], a[:, None]], axis=1),
        lambda a: numpy.ascontiguousarray(
            numpy.concatenate([a[None], a[None]], axis=0).T
        ),
        lambda a: numpy.stack([a, a]).T,
        lambda a: numpy.vstack([a, a]).T,
        lambda a: numpy.concatenate([a[None], a[None]], axis=0).T,
    ],
    labels=[
        "c_",
        "ascont(stack)",
        "ascont(vstack)",
        "column_stack",
        "concat",
        "ascont(concat)",
        "stack (non-cont)",
        "vstack (non-cont)",
        "concat (non-cont)",
    ],
    n_range=[2 ** k for k in range(20)],
    xlabel="len(a)",
    logx=True,
    logy=True,
)

— Nico Schlömer
แหล่งที่มา

1

กราฟที่ดี! แค่คิดว่าคุณต้องการที่จะรู้ว่าภายใต้ประทุน, stack, hstack, vstack, column_stack, มีฟังก์ชั่นช่วยที่สร้างขึ้นบนdstack np.concatenateโดยการติดตามผ่านนิยามของสแต็คก็พบว่ามีการโทรnp.stack([a,a]) np.concatenate([a[None], a[None]], axis=0)มันอาจเป็นการดีที่จะเพิ่มnp.concatenate([a[None], a[None]], axis=0).Tไปยัง perfplot เพื่อแสดงว่าnp.concatenateอย่างน้อยก็เร็วเท่าฟังก์ชั่นผู้ช่วย

— unutbu

@unutbu เพิ่มว่า

— Nico Schlömer

ห้องสมุดที่ดีไม่เคยได้ยินมัน! น่าสนใจพอที่ฉันได้แค่แปลงเดียวกันยกเว้นว่า stack และ concat มีการเปลี่ยนสถานที่ (ทั้งในรูปแบบ ascont และ non-cont) บวก concat-column และ column_stack สลับกันเช่นกัน

— Antony Hatchkins

1

ว้าวรักแปลงเหล่านี้!

— jhegedus

ดูเหมือนว่าสำหรับการดำเนินการแบบเรียกซ้ำของการผนวกคอลัมน์ไปยังอาร์เรย์เช่น b = [b, a] คำสั่งบางคำสั่งไม่ทำงาน (ข้อผิดพลาดเกี่ยวกับมิติที่ไม่เท่ากันถูกยกขึ้น) มีเพียงสองคนเท่านั้นที่ดูเหมือนจะทำงานกับอาร์เรย์ที่มีขนาดไม่เท่ากัน (เช่นเมื่อหนึ่งคือเมทริกซ์และอีกหนึ่งเป็นเวกเตอร์ 1d) เป็นc_และcolumn_stack

— Confounded

29

ฉันคิด:

np.column_stack((a, zeros(shape(a)[0])))

สง่างามมากขึ้น

— user2820502
แหล่งที่มา

12

np.concatenateยังใช้งานได้

>>> a = np.array([[1,2,3],[2,3,4]])
>>> a
array([[1, 2, 3],
       [2, 3, 4]])
>>> z = np.zeros((2,1))
>>> z
array([[ 0.],
       [ 0.]])
>>> np.concatenate((a, z), axis=1)
array([[ 1.,  2.,  3.,  0.],
       [ 2.,  3.,  4.,  0.]])

— han4wluc
แหล่งที่มา

np.concatenateดูเหมือนว่าจะเร็วกว่า 3 เท่าnp.hstackสำหรับเมทริกซ์ 2x1, 2x2 และ 2x3 np.concatenateก็เร็วกว่าการทำสำเนาเมทริกซ์ด้วยตนเองลงในเมทริกซ์เปล่าในการทดลองของฉัน ที่สอดคล้องกับคำตอบของ Nico Schlömerด้านล่าง

— Lenar Hoyt

11

สมมติว่าMเป็น (100,3) ndarray และyเป็น (100,) ndarray appendสามารถใช้ดังนี้:

M=numpy.append(M,y[:,None],1)

เคล็ดลับคือการใช้

y[:, None]

สิ่งนี้จะแปลงyเป็นอาร์เรย์ 2 มิติ (100, 1)

M.shape

ตอนนี้ให้

(100, 4)

— Roel Verhoeven
แหล่งที่มา

คุณเป็นฮีโร่ที่คุณรู้หรือไม่! นั่นคือสิ่งที่ฉันดึงผมตลอด 1 ชั่วโมงที่ผ่านมา! ไท!

— John Doe

8

ฉันชอบคำตอบของ JoshAdel เพราะมุ่งเน้นไปที่ประสิทธิภาพ การปรับปรุงประสิทธิภาพเล็กน้อยคือการหลีกเลี่ยงค่าใช้จ่ายในการเริ่มต้นด้วยเลขศูนย์เท่านั้นที่จะถูกเขียนทับ สิ่งนี้มีความแตกต่างที่วัดได้เมื่อ N มีขนาดใหญ่ว่างถูกใช้แทนค่าศูนย์และคอลัมน์ของศูนย์จะถูกเขียนเป็นขั้นตอนแยกต่างหาก:

In [1]: import numpy as np

In [2]: N = 10000

In [3]: a = np.ones((N,N))

In [4]: %timeit b = np.zeros((a.shape[0],a.shape[1]+1)); b[:,:-1] = a
1 loops, best of 3: 492 ms per loop

In [5]: %timeit b = np.empty((a.shape[0],a.shape[1]+1)); b[:,:-1] = a; b[:,-1] = np.zeros((a.shape[0],))
1 loops, best of 3: 407 ms per loop

— toddInPortland
แหล่งที่มา

คุณสามารถใช้การกระจายเสียงเพื่อเติมเต็มคอลัมน์สุดท้ายด้วยศูนย์ (หรือค่าอื่น ๆ ) b[:,-1] = 0ซึ่งอาจจะสามารถอ่านเพิ่มเติมได้ที่: ยิ่งไปกว่านั้นด้วยอาร์เรย์ที่มีขนาดใหญ่มากความแตกต่างของประสิทธิภาพการทำงานnp.insert()จะเล็กน้อยซึ่งอาจทำให้np.insert()เป็นที่ต้องการมากขึ้นเนื่องจากความรัดกุม

— blubberdiblub

7

np.insert ยังทำหน้าที่วัตถุประสงค์

matA = np.array([[1,2,3], 
                 [2,3,4]])
idx = 3
new_col = np.array([0, 0])
np.insert(matA, idx, new_col, axis=1)

array([[1, 2, 3, 0],
       [2, 3, 4, 0]])

มันแทรกค่าที่นี่new_colก่อนที่ดัชนีที่กำหนดที่นี่idxพร้อมหนึ่งแกน กล่าวอีกนัยหนึ่งค่าที่แทรกใหม่จะครอบครองidxคอลัมน์และย้ายสิ่งที่มีอยู่เดิมที่และidxหลัง

— ไท
แหล่งที่มา

1

โปรดทราบว่าไม่ได้insertอยู่ในตำแหน่งที่ใคร ๆ จะถือว่าได้รับชื่อของฟังก์ชัน (ดูเอกสารที่ลิงก์ในคำตอบ)

— jneuendorf

5

เพิ่มคอลัมน์พิเศษลงในอาร์เรย์ที่มีค่า:

np.appendวิธีการของนัมมี่ใช้พารามิเตอร์สามตัวโดยสองตัวแรกคืออาเรย์ numpy สองมิติและที่สามคือพารามิเตอร์แกนที่สั่งให้พร้อมที่จะผนวกแกน:

import numpy as np  
x = np.array([[1,2,3], [4,5,6]]) 
print("Original x:") 
print(x) 

y = np.array([[1], [1]]) 
print("Original y:") 
print(y) 

print("x appended to y on axis of 1:") 
print(np.append(x, y, axis=1))

พิมพ์:

Original x:
[[1 2 3]
 [4 5 6]]
Original y:
[[1]
 [1]]
x appended to y on axis of 1:
[[1 2 3 1]
 [4 5 6 1]]

— ฮัสซันบาฮาลู
แหล่งที่มา

โปรดทราบว่าคุณกำลังผนวก y กับ x ที่นี่แทนที่จะเพิ่ม x ถึง y นั่นคือเหตุผลที่เวกเตอร์คอลัมน์ของ y อยู่ทางขวาของคอลัมน์ของ x ในผลลัพธ์

— Brian Popeck

4

สายไปงานเลี้ยงเล็กน้อย แต่ยังไม่มีใครโพสต์คำตอบนี้ดังนั้นเพื่อความสมบูรณ์: คุณสามารถทำได้ด้วย list comprehensions ใน Python array:

source = a.tolist()
result = [row + [0] for row in source]
b = np.array(result)

— BTK
แหล่งที่มา

4

สำหรับฉันวิธีถัดไปดูสวยใช้งานง่ายและง่าย

zeros = np.zeros((2,1)) #2 is a number of rows in your array.   
b = np.hstack((a, zeros))

— ชิมอนเอส
แหล่งที่มา

3

ในกรณีของฉันฉันต้องเพิ่มคอลัมน์ของคอลัมน์ในอาร์เรย์ NumPy

X = array([ 6.1101, 5.5277, ... ])
X.shape => (97,)
X = np.concatenate((np.ones((m,1), dtype=np.int), X.reshape(m,1)), axis=1)

หลังจาก X.shape => (97, 2)

array([[ 1. , 6.1101],
       [ 1. , 5.5277],
...

— Mircea Stanciu
แหล่งที่มา

1

มีฟังก์ชั่นเฉพาะสำหรับเรื่องนี้ มันเรียกว่า numpy.pad

a = np.array([[1,2,3], [2,3,4]])
b = np.pad(a, ((0, 0), (0, 1)), mode='constant', constant_values=0)
print b
>>> array([[1, 2, 3, 0],
           [2, 3, 4, 0]])

นี่คือสิ่งที่กล่าวใน docstring:

Pads an array.

Parameters
----------
array : array_like of rank N
    Input array
pad_width : {sequence, array_like, int}
    Number of values padded to the edges of each axis.
    ((before_1, after_1), ... (before_N, after_N)) unique pad widths
    for each axis.
    ((before, after),) yields same before and after pad for each axis.
    (pad,) or int is a shortcut for before = after = pad width for all
    axes.
mode : str or function
    One of the following string values or a user supplied function.

    'constant'
        Pads with a constant value.
    'edge'
        Pads with the edge values of array.
    'linear_ramp'
        Pads with the linear ramp between end_value and the
        array edge value.
    'maximum'
        Pads with the maximum value of all or part of the
        vector along each axis.
    'mean'
        Pads with the mean value of all or part of the
        vector along each axis.
    'median'
        Pads with the median value of all or part of the
        vector along each axis.
    'minimum'
        Pads with the minimum value of all or part of the
        vector along each axis.
    'reflect'
        Pads with the reflection of the vector mirrored on
        the first and last values of the vector along each
        axis.
    'symmetric'
        Pads with the reflection of the vector mirrored
        along the edge of the array.
    'wrap'
        Pads with the wrap of the vector along the axis.
        The first values are used to pad the end and the
        end values are used to pad the beginning.
    <function>
        Padding function, see Notes.
stat_length : sequence or int, optional
    Used in 'maximum', 'mean', 'median', and 'minimum'.  Number of
    values at edge of each axis used to calculate the statistic value.

    ((before_1, after_1), ... (before_N, after_N)) unique statistic
    lengths for each axis.

    ((before, after),) yields same before and after statistic lengths
    for each axis.

    (stat_length,) or int is a shortcut for before = after = statistic
    length for all axes.

    Default is ``None``, to use the entire axis.
constant_values : sequence or int, optional
    Used in 'constant'.  The values to set the padded values for each
    axis.

    ((before_1, after_1), ... (before_N, after_N)) unique pad constants
    for each axis.

    ((before, after),) yields same before and after constants for each
    axis.

    (constant,) or int is a shortcut for before = after = constant for
    all axes.

    Default is 0.
end_values : sequence or int, optional
    Used in 'linear_ramp'.  The values used for the ending value of the
    linear_ramp and that will form the edge of the padded array.

    ((before_1, after_1), ... (before_N, after_N)) unique end values
    for each axis.

    ((before, after),) yields same before and after end values for each
    axis.

    (constant,) or int is a shortcut for before = after = end value for
    all axes.

    Default is 0.
reflect_type : {'even', 'odd'}, optional
    Used in 'reflect', and 'symmetric'.  The 'even' style is the
    default with an unaltered reflection around the edge value.  For
    the 'odd' style, the extented part of the array is created by
    subtracting the reflected values from two times the edge value.

Returns
-------
pad : ndarray
    Padded array of rank equal to `array` with shape increased
    according to `pad_width`.

Notes
-----
.. versionadded:: 1.7.0

For an array with rank greater than 1, some of the padding of later
axes is calculated from padding of previous axes.  This is easiest to
think about with a rank 2 array where the corners of the padded array
are calculated by using padded values from the first axis.

The padding function, if used, should return a rank 1 array equal in
length to the vector argument with padded values replaced. It has the
following signature::

    padding_func(vector, iaxis_pad_width, iaxis, kwargs)

where

    vector : ndarray
        A rank 1 array already padded with zeros.  Padded values are
        vector[:pad_tuple[0]] and vector[-pad_tuple[1]:].
    iaxis_pad_width : tuple
        A 2-tuple of ints, iaxis_pad_width[0] represents the number of
        values padded at the beginning of vector where
        iaxis_pad_width[1] represents the number of values padded at
        the end of vector.
    iaxis : int
        The axis currently being calculated.
    kwargs : dict
        Any keyword arguments the function requires.

Examples
--------
>>> a = [1, 2, 3, 4, 5]
>>> np.pad(a, (2,3), 'constant', constant_values=(4, 6))
array([4, 4, 1, 2, 3, 4, 5, 6, 6, 6])

>>> np.pad(a, (2, 3), 'edge')
array([1, 1, 1, 2, 3, 4, 5, 5, 5, 5])

>>> np.pad(a, (2, 3), 'linear_ramp', end_values=(5, -4))
array([ 5,  3,  1,  2,  3,  4,  5,  2, -1, -4])

>>> np.pad(a, (2,), 'maximum')
array([5, 5, 1, 2, 3, 4, 5, 5, 5])

>>> np.pad(a, (2,), 'mean')
array([3, 3, 1, 2, 3, 4, 5, 3, 3])

>>> np.pad(a, (2,), 'median')
array([3, 3, 1, 2, 3, 4, 5, 3, 3])

>>> a = [[1, 2], [3, 4]]
>>> np.pad(a, ((3, 2), (2, 3)), 'minimum')
array([[1, 1, 1, 2, 1, 1, 1],
       [1, 1, 1, 2, 1, 1, 1],
       [1, 1, 1, 2, 1, 1, 1],
       [1, 1, 1, 2, 1, 1, 1],
       [3, 3, 3, 4, 3, 3, 3],
       [1, 1, 1, 2, 1, 1, 1],
       [1, 1, 1, 2, 1, 1, 1]])

>>> a = [1, 2, 3, 4, 5]
>>> np.pad(a, (2, 3), 'reflect')
array([3, 2, 1, 2, 3, 4, 5, 4, 3, 2])

>>> np.pad(a, (2, 3), 'reflect', reflect_type='odd')
array([-1,  0,  1,  2,  3,  4,  5,  6,  7,  8])

>>> np.pad(a, (2, 3), 'symmetric')
array([2, 1, 1, 2, 3, 4, 5, 5, 4, 3])

>>> np.pad(a, (2, 3), 'symmetric', reflect_type='odd')
array([0, 1, 1, 2, 3, 4, 5, 5, 6, 7])

>>> np.pad(a, (2, 3), 'wrap')
array([4, 5, 1, 2, 3, 4, 5, 1, 2, 3])

>>> def pad_with(vector, pad_width, iaxis, kwargs):
...     pad_value = kwargs.get('padder', 10)
...     vector[:pad_width[0]] = pad_value
...     vector[-pad_width[1]:] = pad_value
...     return vector
>>> a = np.arange(6)
>>> a = a.reshape((2, 3))
>>> np.pad(a, 2, pad_with)
array([[10, 10, 10, 10, 10, 10, 10],
       [10, 10, 10, 10, 10, 10, 10],
       [10, 10,  0,  1,  2, 10, 10],
       [10, 10,  3,  4,  5, 10, 10],
       [10, 10, 10, 10, 10, 10, 10],
       [10, 10, 10, 10, 10, 10, 10]])
>>> np.pad(a, 2, pad_with, padder=100)
array([[100, 100, 100, 100, 100, 100, 100],
       [100, 100, 100, 100, 100, 100, 100],
       [100, 100,   0,   1,   2, 100, 100],
       [100, 100,   3,   4,   5, 100, 100],
       [100, 100, 100, 100, 100, 100, 100],
       [100, 100, 100, 100, 100, 100, 100]])

— Ivan Hoffmann
แหล่งที่มา