มี DFA กี่ตัวที่ยอมรับสตริงที่กำหนดสองชุด

แก้ไขจำนวนเต็ม $n$ และตัวอักษร $\Sigma=\{0,1\}$ }กำหนด $DFA(n)$ ให้เป็นคอลเลกชันของออโตมาตา จำกัด ทั้งหมดใน $n$ รัฐด้วยสถานะเริ่มต้น 1 เรากำลังพิจารณาDFAs ทั้งหมด (ไม่ใช่แค่การเชื่อมต่อที่น้อยที่สุด จึง $|DFA(n)| = n^{2n}2^n$ n

ตอนนี้พิจารณาสองสตริงและกำหนดให้เป็นจำนวนขององค์ประกอบของที่ยอมรับทั้งและy ที่ $x,y\in\Sigma^*$ $K(x,y)$ $DFA(n)$ $x$ $y$

คำถาม: ความซับซ้อนของการคำนวณคืออะไร? $K(x,y)$

คำถามนี้มีผลกระทบต่อการเรียนรู้เครื่อง

แก้ไข: ตอนนี้มีคำถามมากมายเกี่ยวกับคำถามนี้ฉันคิดว่ามีความแม่นยำมากขึ้นในการกำหนดสูตร สำหรับให้เป็นชุดของออโตมาตะตามที่กำหนดไว้ข้างต้น สำหรับ , ให้นิยามให้เป็นจำนวนออโตมาตาในที่ยอมรับทั้งคู่ $n\ge1$ $DFA(n)$ $n^{2n}2^n$ $x,y\in\{0,1\}^*$ $K_n(x,y)$ $DFA(n)$ และy ที่คำถาม: สามารถคำนวณได้ในเวลาหรือไม่ $x$ $y$ $K_n(x,y)$ $poly(n,|x|,|y|)$

— Aryeh
แหล่งที่มา

หากคุณแก้ไข DFA โดยไม่แก้ไขสถานะสุดท้ายให้ระบุแมป x และ y ให้อยู่ในสถานะเดียวกันซึ่งในกรณีนี้ข้อ จำกัด เพียงอย่างเดียวคือว่ารัฐจะต้องเป็นครั้งสุดท้ายหรือจะจับคู่กับสถานะอื่นสองสถานะ ข้อ จำกัด เพียงอย่างเดียวคือพวกเขาทั้งคู่จะต้องเป็นที่สิ้นสุด ดังนั้นฉันจะยืนยันปัญหาของคุณอีกว่า "มี DFA จำนวนเท่าใดที่เชื่อมโยง x และ y ไปยังสถานะที่แตกต่างกัน"

— a3nm

Aryeh คุณช่วยอธิบายการนับ

ไหม? ฉันไม่สามารถรับปัจจัย

ได้ เพิ่ม: โอ๊ะฉันลืมระบุสถานะสุดท้าย อย่างไรก็ตามเพื่อประโยชน์ของผู้อื่นนี่คือวิธีการนับไป สำหรับแต่ละรัฐระบุว่าจะไปที่อินพุต

และ

; ว่าบัญชีสำหรับ

ระบุชุดสถานะสุดท้าย; ว่า

n2n2n $n^{2n} 2^n$

2n $2^n$

0 $0$

1 $1$

n2n $n^{2n}$

2n $2^n$

— Srivatsan Narayanan

อันที่จริงผมไม่สนใจสิ่งที่เกิดขึ้นกับสายอื่น ๆ กว่า

และy ที่

ฉันเดาว่าต้องการคะแนนจำนวนหนึ่งเพื่อเริ่มรับรางวัลหรือไม่ x $x$

y $y$

— Aryeh

ออโตเมติกที่เล็กที่สุดที่รับ

และ

มีสถานะเดียวดังนั้นฉันจึงไม่คิดว่ามันจะให้ข้อมูลที่น่ากลัว ...x $x$

y $y$

— Aryeh

นี่คือความคิด: เราจะต้องทราบหมายเลขของ

DFAs -state ซึ่งสิ้นสุดในรัฐเดียวกันกับ

และy ที่

ขอหมายเลขนี้จะเป็น

และ

เป็นจำนวน DFAs คือ

จากนั้นคำตอบคือ

n $n$

x $x$

y $y$

m $m$

M $M$

M=n2n2n $M=n^{2n}2^{n}$

นี่ให้ขอบเขต เพื่อคำนวณ

คิดอื่นคือการที่เราสามารถลืมเกี่ยวกับส่วนที่เริ่มต้นร่วมกันของ

และ

และยังคิดว่า WLOG

และ

ข

เราเท่านั้นที่จะนับจำนวนของ DABs ความไบนารีกับ

รัฐและความสูงที่มากที่สุด

ที่

และ

ท้ายขึ้นมาในสถานที่เดียวกันและจากว่ามันเป็นเรื่องง่ายที่จะคำนวณเมตร12m+14(M−m) $\frac{1}{2}m + \frac{1}{4}(M-m)$

m $m$

x $x$

y $y$

x=0a $x=0^a$

b=1b $b=1^b$

l $l$

max{a,b} $\max\{a,b\}$

0a $0^a$

1b $1^b$

m $m$

— Kaveh

คำตอบ:

ดังนั้นคำถามสั้น ๆ แต่น่าสนใจมาก ฉันคิดว่าอินพุตเป็นในเอกภาพและและเลขฐานสอง (หรือเรามีปัญหาตามที่ระบุโดยคำตอบของไค) $n$ $x$ $y$

First of all, if you are interested in knowing $K(x,y)$ approximately, then you can just generate a few random DFA's and this will give you (whp) a good approximation. (I wonder if this complexity class has a name.)

จากนั้นการรู้ว่าดูเหมือนว่าจะเป็นปัญหาที่ยากลำบาก ตามที่ระบุไว้ในความคิดเห็นโดย a3_nm และ Kaveh คำถามนั้นเทียบเท่ากับการกำหนดจำนวนออโตมาตะที่ $K(x,y)$ $x$ and $y$ go to the same state. I will denote the probability that they go to the same state by $p$ .

Update: Some of the things I wrote here were not true, now I fixed them.

มันเป็นเรื่องง่ายที่จะเห็นว่า nเรามีความเสมอภาคถ้าคือ 0 ทั้งหมดและเป็นศูนย์ทั้งหมดยกเว้นบิตสุดท้ายซึ่งคือ 1 มีกรณีอื่นอีกไหม? ฉันไม่รู้ ถ้าเช่นคือสตริงว่างและดังนั้น $p \ge 1/n$ $x$ $y$ $x$ $y=00$ n $p= \frac{n+1}{(n-1)n}$

To simplify the problem, I even started to think about what happens if $x$ and $y$ are unary. If both are at least $n$ and their difference is divisible by $n!$ , then $p=1$ . Is there a simple formula for the unary version?

— domotorp
แหล่งที่มา

I've clarified the problem -- a

poly(n,|x|,|y|) $poly(n,|x|,|y|)$ algorithm is desired (or a reduction from some known hard problem). The sampling approximation is employed in the paper where this kernel is introduced:portal.acm.org/citation.cfm?id=1577108

— Aryeh

As for the unary version: there are only polynomially many

n $n$ -state unary automata, so I would bet that there is a poly-time algorithm for computing

Kn(x,y) $K_n(x,y)$ for this case.

— Aryeh

Indeed, you are absolutely right that the unary version is computable. I still wonder how simple the formula is for a given x and y.

— domotorp

The reduction you have used is buggy: x and y may be accepted by the same automata and end in completely different states, in fact, they may share only the starting state in their paths, which is true for all strings.

— amnn

@amnn: It has been three years since I wrote this, but doesn't the third para of my answer explain why I deal only with ending in the same state?

— domotorp

I may very well be missing the point but you stated that $n$ is fixed, so all DFAs of that size could be considered precomputed and stored in an easily simulatable format. Compute $K$ as follows:

On input $x$ , $y$ where $x,y\in\Sigma^*$

store $x$ and $y$
initialize counter $c$ to $0$
for each of your $n^{2n} 2^n$ DFAs
a. simulate it on both words (this step is $\mathcal{O}(|xy|)$ )

b. increment $c$ if both simulation runs are accepting
output $c$

Altogether, the computation has linear complexity. The answer is quite different for $K(n,x,y)$ .

— Kai
แหล่งที่มา

Clearly trying all machines will work. Aryeh wants to know if there's, perhaps, a polynomial time algorithm or otherwise some hardness result.

— Lev Reyzin

Strictly speaking this is polynomial time in the input, if n is not part of the input, that is what Kai was saying. But the question is clearly different.

— domotorp

Oh I see. I don't think that's what he means by "fix

n $n$ ." I think the natural interpretation of the problem is one that doesn't trivialize it.

— Lev Reyzin

Right, thanks for pointing out the loophole, Kai. It's been fixed :)

— Aryeh