ความซับซ้อนหรือวงจรของตัวดำเนินการเชิงเส้นหนาแน่น

พิจารณาวงจรแบบโมโนโทนเดียวต่อไปนี้: แต่ละเกจเป็นเพียงไบนารีหรือ อะไรคือความซับซ้อนของฟังก์ชั่น $f(x)=Ax$ โดยที่ $A$ คือบูลีนเมทริกซ์ที่มี 0 มันสามารถคำนวณได้โดยขนาดเชิงเส้นหรือวงจร? $n \times n$ $O(n)$

อีกอย่างเป็นทางการคือฟังก์ชันจากถึงบิต เอาท์พุท -th ของมี (เช่นการหรือการย่อยของบิตการป้อนข้อมูลที่ได้รับจากแถว -th ของ) $f$ $n$ $n$ $i$ $f$ $\bigvee_{j=1}^{n}(A_{ij} \land x_j)$ $i$ $A$

โปรดทราบว่า 0 แยกแถวของเป็นช่วง (ชุดย่อยที่ประกอบด้วยองค์ประกอบต่อเนื่องของ ) สิ่งนี้ทำให้สามารถใช้โครงสร้างข้อมูลแบบสอบถามช่วงที่ทราบได้ เช่นโครงสร้างข้อมูลตารางเบาบางสามารถกลายเป็นหรือวงจรขนาดn) อัลกอริธึมของ Yaoสำหรับการสอบถามโอเปอเรเตอร์กึ่ง semigroup สามารถเปลี่ยนเป็นวงจรเชิงเส้นตรงเกือบ (ขนาดโดยที่คือ Invermann ผกผัน) $O(n)$ $A$ $O(n)$ $[n]$ $O(n\log n)$ $O(\alpha(n) \cdot n)$ $\alpha(n)$

โดยเฉพาะฉันไม่ทราบวิธีสร้างวงจรขนาดเชิงเส้นสำหรับกรณีพิเศษที่แต่ละแถวของประกอบด้วยศูนย์สองตัว ในขณะที่กรณีของศูนย์หนึ่งในแต่ละแถวเป็นเรื่องง่าย (แต่ละฟังก์ชั่นเอาท์พุทสามารถคำนวณได้โดย OR ของคำนำหน้าและคำต่อท้ายซึ่งสามารถคำนวณได้ล่วงหน้าโดยหรือประตู) $A$ $[1..k-1]$ $[k+1..n]$ $2n$

ds.algorithms circuit-complexity upper-bounds

— Alexander S. Kulikov
แหล่งที่มา

หนึ่งขอบเขตบนเป็นที่รู้จักกัน: มันเป็นอย่างมาก rk (A) คูณ n หารด้วย log n โดยที่ rk (A) คือ OR หรือยศของเมทริกซ์บูลีน A (= จำนวนขั้นต่ำของเมทริกซ์ย่อยทั้งหมด -1 ซึ่ง OR สอดคล้องกับ A ) ดูเลมม่า 2.5 ในหนังสือเล่มนี้ ดังนั้นอันดับบูลีนของเมทริกซ์ nxn ที่มี O (n) zeroes ใหญ่เท่าใด

— Stasys

@ Stasys ขอบคุณ Stasys! แล้วเมทริกซ์ที่มีเส้นทแยงมุมเป็นศูนย์ OR-rank นั้นเป็นเส้นตรงใช่ไหม

— Alexander S. Kulikov

OR อันดับของเมทริกซ์ของคุณ (ศูนย์แนวทแยงมุมและ 1s ที่อื่น) อยู่ที่ 2 \ log n: เลเบลแถว / คอลัมน์โดยสตริงไบนารีของความยาว \ log n และพิจารณารูปสี่เหลี่ยมผืนผ้า {(r, c): r (i) = a, c (i) = 1-a} สำหรับ a = 0,1 โปรดสังเกตว่า Lemma 2.5 เป็นขอบเขตบน ต่ำผูกพันในแง่ของหรือตำแหน่งจะได้รับใน Thm 3.20 ยิ่งไปกว่านั้น log or OR rank ก็คือความซับซ้อนของการสื่อสารแบบ nondeterministic

— Stasys

@ Stasys โอ้ใช่แล้ว!

— Alexander S. Kulikov

คำตอบ:

นี่คือคำตอบบางส่วน (ยืนยัน) ในกรณีที่เรามีขอบเขตบนจำนวนศูนย์ในทุกแถวหรือในทุกคอลัมน์

สี่เหลี่ยมผืนผ้าเป็นเมทริกซ์บูลประกอบด้วยหนึ่งทั้งหมด 1 submatrix และมีศูนย์อื่น ๆ OR- อันดับของเมทริกซ์บูลีนคือจำนวนน้อยที่สุดของรูปสี่เหลี่ยมที่ว่าสามารถเขียนเป็น a (componentwise) หรือของสี่เหลี่ยมเหล่านี้ นั่นคือทุกๆ 1 รายการของเป็น 1 รายการในรูปสี่เหลี่ยมอย่างน้อยหนึ่งรายการและทุก 0 รายการของคือรายการ 0 รายการในทุกรูปสี่เหลี่ยมผืนผ้า โปรดสังเกตว่าเป็นความซับซ้อนของการสื่อสารแบบ nondeterministic ของเมทริกซ์ $rk(A)$ $r$ $A$ $A$ $A$ $\log rk(A)$ $A$ (ที่อลิซรับแถวและคอลัมน์บ๊อบ) ในฐานะที่เป็น OP เขียนทุกบูลเมทริกซ์กำหนดแผนที่ที่ สำหรับม.นั่นคือเรานำเมทริกซ์ - เวกเตอร์มาแทนบูลีน semiring $m\times n$ $A=(a_{i,j})$ $y=Ax$ $y_i=\bigvee_{j=1}^na_{i,j}x_j$ $i=1,\ldots,m$

บทแทรกต่อไปนี้เป็นเพราะPudlákและRödl; ดูข้อเสนอ 10.1 ในเอกสารนี้ หรือบทแทรก 2.5 ในหนังสือเล่มนี้สำหรับการก่อสร้างโดยตรง

เลมม่า 1:สำหรับทุกบูลีนเมทริกซ์การทำแผนที่สามารถคำนวณได้โดย fanin ที่ไม่มีขอบเขตหรือวงจรของความลึก -3 โดยใช้สายมากที่สุด $n\times n$ $A$ $y=Ax$ $O(rk(A)\cdot n/\log n)$

นอกจากนี้เรายังมีขอบเขตบนต่อไปนี้ในการจัดอันดับ OR ของเมทริกซ์หนาแน่น อาร์กิวเมนต์เป็นรูปแบบที่เรียบง่ายของ Alon ที่ใช้ในบทความนี้

เลมม่า 2:ถ้าทุกคอลัมน์หรือทุกแถวของเมทริกบูลีนมีค่าศูนย์ที่สุดดังนั้นโดยที่เป็นจำนวนใน $A$ $d$ $rk(A)=O(d\ln|A|)$ $|A|$ $1$ $A$

พิสูจน์: สร้างสุ่ม all- submatrix โดยการเลือกแต่ละแถวอิสระกับความน่าจะเป็นแบบเดียวกัน )ให้เป็นเซตย่อยแบบสุ่มที่ได้รับของแถว แล้วปล่อยให้ที่คือชุดของคอลัมน์ทั้งหมดของที่มีศูนย์ในแถวในไม่มีฉัน $1$ $R$ $p=1/(d+1)$ $I$ $R=I\times J$ $J$ $A$ $I$

-entry ของถูกปกคลุมด้วยถ้าได้รับเลือกใน และไม่มีใคร (ที่มากที่สุด ) แถวกับในคอลัมน์ -th ได้รับเลือกในฉันดังนั้นรายการถูกปกคลุมด้วยความน่าจะเป็นอย่างน้อย $1$ $(i,j)$ $A$ $R$ $i$ $I$ $d$ $0$ $j$ $I$ $(i,j)$ Eถ้าเราใช้วิธีนี้ครั้งเพื่อให้ได้สี่เหลี่ยมแล้วน่าจะเป็นที่ได้รับการคุ้มครองโดยไม่มีใครสี่เหลี่ยมเหล่านี้ไม่เกินอีจากความผูกพันของสหภาพความน่าจะเป็นที่ของยังคงเปิดเผยอยู่มากที่สุด $p(1-p)^{d}\geq pe^{-pd-p^2d}\geq p/e$ $r$ $r$ $(i,j)$ $(1-p/e)^r\leq e^{-rp/e}$ $1$ $A$ $|A|\cdot e^{-rp/e}$ ซึ่งมีขนาดเล็กกว่าสำหรับ ) $1$ $r=O(d\ln|A|)$ $\Box$

ควันหลง:ถ้าทุกคอลัมน์หรือแถวของเมทริกซ์บูลทุกมีที่มากที่สุดศูนย์แล้วการทำแผนที่สามารถคำนวณได้โดยมากมาย fanin หรือวงจรของความลึก-3 ใช้ สาย $A$ $d$ $y=Ax$ $O(dn)$

ฉันเดาว่าขอบเขตบนที่คล้ายกันเช่นเดียวกับใน Lemma 2 ควรค้างไว้เมื่อคือจำนวนเฉลี่ยวินาทีในคอลัมน์ (หรือเป็นแถว) มันน่าสนใจที่จะแสดงสิ่งนี้ $d$ $1$

หมายเหตุ: (เพิ่ม 04.01.2018) อะนาล็อกของเล็มม่า 2 ยังถือครองเมื่อคือจำนวนเฉลี่ยสูงสุดของเลขศูนย์ในรูปแบบย่อยของโดยที่เลขศูนย์เฉลี่ยในเมทริกซ์คือจำนวนของศูนย์หารด้วย Rนี้ต่อไปนี้จากทฤษฏี 2 ในเอ็นอีตันและวี RODL ;, กราฟของมิติขนาดเล็ก Combinatorica 16 (1) (1996) 59-85 ขอบบนที่แย่ลงเล็กน้อย $rk(A)=O(d^2\log n)$ $d$ $A$ $r\times s$ $s+r$ สามารถรับได้โดยตรงจาก Lemma 2 ดังต่อไปนี้ $rk(A)=O(d^2\ln^2 n)$

แทรก 3: Let 1ถ้าทุกกราฟย่อยของกราฟสองส่วนที่ทอดข้ามมีระดับเฉลี่ยดังนั้นสามารถเขียนเป็นสหภาพซึ่งระดับซ้ายสุดของและระดับขวาสุดของคือ . $d\geq 1$ $G$ $\leq d$ $G$ $G=G_1\cup G_2$ $G_1$ $G_2$ $\leq d$

พิสูจน์:การเหนี่ยวนำจำนวนของจุดยอด กรณีฐานและชัดเจน สำหรับขั้นตอนการเหนี่ยวนำเราจะสีขอบสีฟ้าและสีแดงเพื่อให้ระดับสูงสุดทั้งใน subgraphs สีฟ้าและสีแดงเป็น dใช้จุดสุดยอดปริญญา ; เช่นต้องจุดสุดยอดอยู่เพราะยังมีการศึกษาระดับปริญญาเฉลี่ยของกราฟทั้งต้อง dถ้าอยู่ในส่วนด้านซ้ายแล้วให้ทุกสีตกกระทบขอบกับสีน้ำเงินส่วนสีอื่น ๆ ทั้งหมดจะเป็นสีแดง ถ้าเราลบจุดสุดยอด $n$ $n=1$ $n=2$ $\leq d$ $u$ $\leq d$ $\leq d$ $u$ $u$ $u$ then the average degree of the resulting graph $G$ is also at most $d$ , and we can color the edges of this graph by the induction hypothesis. $\Box$

Lemma 4: Let $d\geq 1$ . If the maximum average number of zeros in a boolean $n\times n$ matrix $A=(a_{i,j})$ is at most $d$ , then $rk(A)=O(d^2\ln^2 n)$ .

Proof: Consider the bipartite $n\times n$ graph $G$ with $(i,j)$ being an edge iff $a_{i,j}=0$ . Then the maximum average degree of $G$ is at most $d$ . By Lemma 3, we can write $G=G_1\cup G_2$ , where the maximum degree of the vertices on the left part of $G_1$ , and the maximum degree of the vertices on the right part of $G_2$ is $\leq d$ . Let $A_1$ and $A_2$ be the complements of the adjacency matrices of $G_1$ and $G_2$ . Hence, $A= A_1\land A_2$ is a componentwise AND of these matrices. The maximum number of zeros in every row of $A_1$ and in every column of $A_2$ is at most $d$ . Since $rk(A)\leq rk(A_1)\cdot rk(A_2)$ , Lemma 2 yields $rk(A)=O(d^2\ln^2 n)$ . $\Box$

N.B. The following simple example (pointed by Igor Sergeev) shows that my "guess" at the end of the answer was totally wrong: if we take $d=d(A)$ to be the average number of zeros in the entire matrix $A$ (not the maximum of averages over all submatrices), then Lemma 2 can badly fail. Let $m=\sqrt{n}$ , and put an identity $m\times m$ matrix in, say left upper corner of $A$ , and fill the remaining entries by ones. Then $d(A)\leq m^2/2n < 1$ but $rk(A)\geq m$ , which is exponentially larger than $\ln|A|$ . Note, however, that the OR complexity of this matrix is very small, is $O(n)$ . So, direct arguments (not via rank) can yield much better upper bounds on the OR complexity of dense matrices.

— Stasys
แหล่งที่มา

Thanks a lot, Stasys! This is nice! In the meantime, Ivan Mihajlin came with another proof. I've posted it below.

— Alexander S. Kulikov

(I tried to post this as a comment to Stasys' answer above, but this text is too long for a comment, so posting it as an answer.) Ivan Mihajlin (@ivmihajlin) came up with the following construction. Similarly to Stasys' proof, it works for the case when the maximum (rather than average) number of 0’s in each row is bounded.

First, consider the case when every row contains exactly two zeros. Consider the following undirected graph: the set of vertices is $[n]$ ; two nodes $i$ and $j$ are joined by an edge, if there is a row having zeros in columns $i$ and $j$ . The graph has $n$ edges and hence it contains a cut $(L,R)$ of size at least $n/2$ . This cut splits the columns of the matrix into two parts ( $L$ and $R$ ). Let now also split the rows into two parts: the top part $T$ contains all columns that have exactly one zero in both $L$ and $R$ ; the bottom part $B$ contains all the remaining rows. What is nice about the top part of the matrix ( $T \times (L \cup R)$ ) is that it can be computed by $O(n)$ gates. For the bottom part, let’s cut all-1 columns out of it and make a recursive call. The corresponding recurrence relation is $C(n) \le an + C(n/2)$ implying $C(n)=O(n)$ .

Now, generalize it to the case of at most $d$ zeros in every row. Let $C_d(n)$ be the complexity of an $n \times (\le dn)$ matrix with at most $d$ zeros per row (if there are more than $dn$ columns, then some of them are all-1). Partition the columns into two parts $L$ and $R$ such that at least $n(1-2^{-d})$ rows (call them $T$ ) satisfy the following property: if there are exactly $d$ zeroes in a row, then not all of them belong to the same part (denote the remaining rows by $B$ ). Then make three recursive calls: $T \times L$ , $T \times R$ , and $B \times (L \cup R)$ . This gives a recurrence relation $C_d(n) \le an + 2\cdot C_{d-1}(n(1-2^{-d}))+C_d(2^{-d}n)$ . This, in turn, implies that $C_d(n) \le f(d)\cdot n$ . The function $f(d)$ is exponential, but still.

— Alexander S. Kulikov
แหล่งที่มา

A nice argument. But it seems to be tailor made for the case of d=2 zeros per row. What about d>2 zeros?

— Stasys

@Stasys, it is doable if I'm not mistaken. I've updated the answer.

— Alexander S. Kulikov