ความน่าจะเป็นที่เครือข่ายการเรียงลำดับแบบสุ่มทำงาน

รับอินพุต , เราสร้างเครือข่ายการเรียงลำดับแบบสุ่มด้วย gates โดยการเลือกตัวแปรสองตัวกับและเพิ่มประตูตัวเปรียบเทียบที่แลกเปลี่ยนหาก . $n$ $x_0, \ldots, x_{n-1}$ $m$ $x_i, x_j$ $i < j$ $x_i > x_j$

คำถามที่ 1 : สำหรับการแก้ไข $n$ วิธีต้องมีขนาดใหญ่ $m$ เป็นเครือข่ายในการจัดเรียงอย่างถูกต้องกับความน่า $> \frac{1}{2}$ ?

เรามีขอบเขตล่างอย่างน้อย $m = \Omega(n^2 \log n)$ เนื่องจากอินพุตที่จัดเรียงอย่างถูกต้องยกเว้นว่าการสลับแต่ละคู่ที่ต่อเนื่องกันจะใช้เวลา $\Theta(n^2 \log n^2)$ สำหรับแต่ละคู่ที่จะถูกเลือกเป็นตัวเปรียบเทียบ . คือว่ายังผูกพันบนอาจจะมีมากขึ้น $\log n$ ปัจจัย?

คำถามที่ 2 : มีการกระจายของประตูเปรียบเทียบที่ประสบความสำเร็จ $m = \tilde{O}(n)$ บางทีโดยการเลือกตัวเปรียบเทียบที่ใกล้เคียงกับความน่าจะเป็นที่สูงขึ้น?

sorting-network

— Geoffrey Irving
แหล่งที่มา

ฉันเดาว่าจะได้

จากขอบเขตโดยดูที่หนึ่งอินพุตในเวลาเดียวกันจากนั้นเชื่อมต่อสหภาพ แต่นั่นฟังดูไม่แน่น

O (n^{3} l o g^{O (1)})

$O(n^3log^{O(1)})$

— daniello

ไอเดียสำหรับคำถามที่ 2: เลือกเครือข่ายการเรียงลำดับของความลึก

)

ในแต่ละขั้นตอนสุ่มเลือกหนึ่งในประตูของเครือข่ายการเรียงลำดับและทำการเปรียบเทียบนั้น หลังจากขั้นตอน

ประตูทั้งหมดในชั้นแรกจะถูกนำไปใช้ หลังจากขั้นตอนอื่น

ประตูทั้งหมดในชั้นที่สองจะถูกนำไปใช้ หากคุณสามารถแสดงให้เห็นว่านี้คือเนื่อง (ใส่รถพิเศษในช่วงกลางของเครือข่ายการจัดเรียงที่ไม่สามารถทำร้าย) คุณจะได้รับการแก้ปัญหาที่มี

O (\log^{2} n)

$O(\log^2 n)$

\tilde{O} (n)

$\tilde{O}(n)$

\tilde{O} (n)

$\tilde{O}(n)$

\tilde{O} (n)

$\tilde{O}(n)$ เปรียบเทียบโดยรวมโดยเฉลี่ย ฉันไม่แน่ใจว่าจะมีความน่าเชื่อถือหรือไม่

— DW

@DW: Monotonicity ไม่จำเป็นต้องถือ พิจารณาลำดับ

ผลงาน

ลำดับ

ไม่ได้ (พิจารณาอินพุต (1, 0, 0)) แนวคิดคือ

\begin{array}{rcl} s & = & (x_{1}, x_{2}), (x_{0}, x_{2}), (x_{0}, x_{1}); \\ s^{'} & = & (x_{1}, x_{2}), (x_{0}, x_{1}), (x_{0}, x_{2}), (x_{0}, x_{1}) . \end{array}

$\begin{eqnarray*} s &=&(x_1, x_2), (x_0, x_2), (x_0, x_1);\\ s'&=&(x_1, x_2), \mathbf{(x_0, x_1)}, (x_0, x_2), (x_0, x_1).\end{eqnarray*}$

s

$s$

s^{'}

$s'$

(x_{0}, x_{2}), (x_{0}, x_{1})

$(x_0, x_2), (x_0, x_1)$ เรียงลำดับอินพุตที่ได้รับยกเว้น

(ดูที่นี่ ) ใน

, การป้อนข้อมูลที่ไม่สามารถเข้าถึง

)

ใน

มันสามารถ

(0, 1, 0)

$(0, 1, 0)$

s

$s$

(x_{0}, x_{2}), (x_{0}, x_{1})

$(x_0, x_2), (x_0, x_1)$

s^{'}

$s'$

— Neal Young

พิจารณาตัวแปรที่เครือข่ายเลือกโดยเลือกตัวแปรที่อยู่ติดกันสองตัวคือ

สุ่มในแต่ละขั้นตอน ตอนนี้มีความน่าเบื่อหน่าย (เนื่องจากการแลกเปลี่ยนที่อยู่ติดกันไม่ได้สร้างการรุกราน) นำความคิดของ @ DW ไปใช้กับเครือข่ายการเรียงลำดับคี่ - คู่ซึ่งมี

รอบ: ในรอบคี่มันเปรียบเทียบคู่ที่อยู่ติดกันทั้งหมดที่

คี่ในรอบแม้จะเปรียบเทียบคู่ที่อยู่ติดกันทั้งหมดที่

เป็นแม้กระทั่ง หากเครือข่ายแบบสุ่มนั้นถูกต้องในการเปรียบเทียบ

เนื่องจาก "รวมถึง" เครือข่ายนี้ (หรือฉันจะพลาดบางสิ่ง)

x_{i}, x_{i + 1}

$x_i, x_{i+1}$

n

$n$

i

$i$

i

$i$

O (n^{2} \log n)

$O(n^2\log n)$

— Neal Young

monotonicity ของเครือข่ายที่อยู่ใกล้เคียง: ให้

สำหรับ

กำหนด

ฉัน

พูด

ถ้า

(

a, b \in {0, 1}^{n}

$a, b\in\{0,1\}^n$

j \in {0, 1, \dots, n}

$j\in\{0,1,\ldots,n\}$

s_{j} (a) = \sum_{i = 1}^{j} a_{i}

$s_j(a) = \sum_{i=1}^j a_i$

a ⪯ b

$a\preceq b$

s_{j} (a) \leq s_{j} (b)

$s_j(a) \le s_j(b)$

\forall j

$\forall j$ ). Fix any comparison "

x_{i} < x_{i + 1}

$x_i < x_{i+1}$ ". Let

a^{'}

$a'$ and

b^{'}

$b'$ come from

a

$a$ and

b

$b$ by doing that comparison. Claim 1. $a' \preceq a$ and $b' \preceq b$ . Claim 2: if $a\preceq b$ , then $a' \preceq b'$ . Then show inductively: if

y

$y$ is the result of comparison sequence

s

$s$ on input

x

$x$ , and

y^{'}

$y'$ is the result of super-sequence

s^{'}

$s'$ of

s

$s$ on

x

$x$ , then

y^{'} ⪯ y

$y' \preceq y$ . So if

y

$y$ is sorted, so is

y^{'}

$y'$ .

— Neal Young

Here's some empirical data for question 2, based on D.W.'s idea applied to bitonic sort. For $n$ variables, choose $j - i = 2^k$ with probability proportional to $\lg n - k$ , then select $i$ uniformly at random to get a comparator $(i,j)$ . This matches the distribution of comparators in bitonic sort if $n$ is a power of 2, and approximates it otherwise.

For a given infinite sequence of gates pulled from this distribution, we can approximate the number of gates required to get a sorting network by sorting many random bit sequences. Here's that estimate for $n < 200$ taking the mean over $100$ gate sequences with $6400$ bit sequences used to approximate the count: It appears to match $\Theta(n \log^2 n)$ , the same complexity as bitonic sort. If so, we don't eat an extra $\log n$ factor due to the coupon collector problem of coming across each gate.

To emphasize: I'm using only $6400$ bit sequences to approximate the expected number of gates, not $2^n$ . The mean required gates does rise with that number: for $n = 199$ if I use $6400$ , $64000$ , and $640000$ sequences the estimates are $14270 \pm 1069$ , $14353 \pm 1013$ , and $14539 \pm 965$ . Thus, it's possible getting the last few sequences increases the asymptotic complexity, though intuitively it feels unlikely.

แก้ไข : นี่คือพล็อตที่คล้ายกันมากถึงแต่ใช้จำนวนประตูที่แน่นอน (คำนวณจากการรวมกันของการสุ่มตัวอย่างและ Z3) ฉันได้เปลี่ยนจากอำนาจของสองไปโดยพลการ $n = 80$ $d = j-i$ มีความน่าจะเป็นเป็นสัดส่วนกับ $d \in [1,\frac{n}{2}]$ d ยังดูน่าเชื่อถือ $\frac{\log n - \log d}{d}$ $\Theta(n \log^2 n)$

— Geoffrey Irving
แหล่งที่มา

เป็นการทดลองที่ดี! มีวิธีที่แตกต่างกันที่ปัญหาของตัวสะสมคูปองอาจเกิดขึ้นได้ที่นี่ แต่คุณจะสุ่มตัวอย่างเพียงเล็กน้อยในลำดับ

บิตที่จำเป็นเพื่อตรวจสอบความถูกต้องของอินพุตทั้งหมด ดูเหมือนว่าเราสามารถสรุปได้ (ทางวิทยาศาสตร์ไม่ใช่ทางคณิตศาสตร์) จากการทดสอบของคุณว่าเครือข่ายแบบสุ่มของขนาดและประเภทนี้จะทำการสุ่มเปลี่ยนรูปแบบ whp ฉันยังอยากรู้อยากเห็นเพื่อดูหมดจด

การทดสอบบนเครือข่ายแบบสุ่มดังกล่าวสำหรับทุก

ถึงที่คุณยินดีที่จะไป (

ไม่ควรเลวร้ายเกินไปบางทีแม้แต่

ขึ้นอยู่กับภาษาและฮาร์ดแวร์ที่คุณใช้)

2^{n}

$2^n$

2^{n}

$2^n$

n

$n$

n = 20

$n=20$

n = 30

$n=30$

— Joshua Grochow

It looks the same for exact up to

n = 27

$n = 27$ , but I don’t view that as conclusive.

— Geoffrey Irving

@JoshuaGrochow: I've added exact values up to

n = 80

$n = 80$ .

— Geoffrey Irving

Nice! There does appear to be a growing spread to the exact data though, which perhaps indicates an upper bound with an extra factor of

\log n

$\log n$ ? (That is, if the "spread" is growing at a rate of

\log n

$\log n$ .)

— Joshua Grochow

Yeah, we can't rule out an extra factor. I'd be surprised if it was

\log n

$\log n$ , though, since up at 80 we have

\lg n \approx 6

$\lg n \approx 6$ and the constant is suspiciously close to

1

$1$ otherwise. At this point I think theory has to take over. :)

— Geoffrey Irving